Dataset statistics
| Number of variables | 40 |
|---|---|
| Number of observations | 45379 |
| Missing cells | 67148 |
| Missing cells (%) | 3.7% |
| Total size in memory | 13.8 MiB |
| Average record size in memory | 320.0 B |
Variable types
| Text | 11 |
|---|---|
| Numeric | 29 |
Science Fiction has constant value "" | Constant |
History has constant value "" | Constant |
belongs_to_collection has 40890 (90.1%) missing values | Missing |
overview has 941 (2.1%) missing values | Missing |
tagline has 24980 (55.0%) missing values | Missing |
popularity is highly skewed (γ1 = 29.21581948) | Skewed |
return is highly skewed (γ1 = 138.3340992) | Skewed |
budget has 36493 (80.4%) zeros | Zeros |
TV Movie has 44612 (98.3%) zeros | Zeros |
Western has 44337 (97.7%) zeros | Zeros |
Documentary has 41458 (91.4%) zeros | Zeros |
Music has 43781 (96.5%) zeros | Zeros |
Foreign has 43758 (96.4%) zeros | Zeros |
War has 44056 (97.1%) zeros | Zeros |
Mystery has 42915 (94.6%) zeros | Zeros |
Science Fiction has 45379 (100.0%) zeros | Zeros |
History has 45379 (100.0%) zeros | Zeros |
Horror has 40708 (89.7%) zeros | Zeros |
Thriller has 37759 (83.2%) zeros | Zeros |
Crime has 41074 (90.5%) zeros | Zeros |
Action has 38785 (85.5%) zeros | Zeros |
Drama has 25123 (55.4%) zeros | Zeros |
Romance has 38646 (85.2%) zeros | Zeros |
Fantasy has 43066 (94.9%) zeros | Zeros |
Adventure has 41885 (92.3%) zeros | Zeros |
Family has 42611 (93.9%) zeros | Zeros |
Comedy has 32198 (71.0%) zeros | Zeros |
Animation has 43446 (95.7%) zeros | Zeros |
revenue has 37972 (83.7%) zeros | Zeros |
runtime has 1535 (3.4%) zeros | Zeros |
vote_average has 2947 (6.5%) zeros | Zeros |
vote_count has 2849 (6.3%) zeros | Zeros |
return has 39998 (88.1%) zeros | Zeros |
Reproduction
| Analysis started | 2023-06-29 22:28:30.182458 |
|---|---|
| Analysis finished | 2023-06-29 22:28:36.714495 |
| Duration | 6.53 seconds |
| Software version | ydata-profiling vv4.3.1 |
| Download configuration | config.json |
MISSING 
| Distinct | 1695 |
|---|---|
| Distinct (%) | 37.8% |
| Missing | 40890 |
| Missing (%) | 90.1% |
| Memory size | 354.6 KiB |
Length
| Max length | 184 |
|---|---|
| Median length | 167 |
| Mean length | 141.4900869 |
| Min length | 75 |
Characters and Unicode
| Total characters | 635149 |
|---|---|
| Distinct characters | 170 |
| Distinct categories | 13 ? |
| Distinct scripts | 7 ? |
| Distinct blocks | 8 ? |
Unique
| Unique | 390 ? |
|---|---|
| Unique (%) | 8.7% |
Sample
| 1st row | {'id': 10194, 'name': 'Toy Story Collection', 'poster_path': '/7G9915LfUQ2lVfwMEEhDsn3kT4B.jpg', 'backdrop_path': '/9FBwqcd9IRruEDUrTdcaafOMKUq.jpg'} |
|---|---|
| 2nd row | {'id': 119050, 'name': 'Grumpy Old Men Collection', 'poster_path': '/nLvUdqgPgm3F85NMCii9gVFUcet.jpg', 'backdrop_path': '/hypTnLot2z8wpFS7qwsQHW1uV8u.jpg'} |
| 3rd row | {'id': 96871, 'name': 'Father of the Bride Collection', 'poster_path': '/nts4iOmNnq7GNicycMJ9pSAn204.jpg', 'backdrop_path': '/7qwE57OVZmMJChBpLEbJEmzUydk.jpg'} |
| 4th row | {'id': 645, 'name': 'James Bond Collection', 'poster_path': '/HORpg5CSkmeQlAolx3bKMrKgfi.jpg', 'backdrop_path': '/6VcVl48kNKvdXOZfJPdarlUGOsk.jpg'} |
| 5th row | {'id': 117693, 'name': 'Balto Collection', 'poster_path': '/w0ZgH6Lgxt2bQYnf1ss74UvYftm.jpg', 'backdrop_path': '/9VM5LiJV0bGb1st1KyHA3cVnO2G.jpg'} |
| Value | Count | Frequency (%) |
| name | 4495 | 9.7% |
| id | 4489 | 9.7% |
| poster_path | 4489 | 9.7% |
| backdrop_path | 4489 | 9.7% |
| collection | 3744 | 8.1% |
| none | 1771 | 3.8% |
| the | 1146 | 2.5% |
| of | 230 | 0.5% |
| series | 147 | 0.3% |
| 139 | 0.3% | |
| Other values (6631) | 21071 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 59197 | 9.3% |
| 41722 | 6.6% | |
| p | 29065 | 4.6% |
| a | 25697 | 4.0% |
| o | 25031 | 3.9% |
| e | 24216 | 3.8% |
| t | 23190 | 3.7% |
| : | 18055 | 2.8% |
| n | 16720 | 2.6% |
| r | 15819 | 2.5% |
| Other values (160) | 356437 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 316950 | |
| Other Punctuation | 105717 | 16.6% |
| Uppercase Letter | 94998 | 15.0% |
| Decimal Number | 56923 | 9.0% |
| Space Separator | 41722 | 6.6% |
| Connector Punctuation | 8978 | 1.4% |
| Open Punctuation | 4824 | 0.8% |
| Close Punctuation | 4824 | 0.8% |
| Dash Punctuation | 162 | < 0.1% |
| Other Letter | 37 | < 0.1% |
| Other values (3) | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 29065 | 9.2% |
| a | 25697 | 8.1% |
| o | 25031 | 7.9% |
| e | 24216 | 7.6% |
| t | 23190 | 7.3% |
| n | 16720 | 5.3% |
| r | 15819 | 5.0% |
| i | 15328 | 4.8% |
| h | 14433 | 4.6% |
| d | 13697 | 4.3% |
| Other values (69) | 113754 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 7691 | 8.1% |
| N | 5092 | 5.4% |
| T | 4595 | 4.8% |
| S | 4188 | 4.4% |
| A | 3722 | 3.9% |
| M | 3695 | 3.9% |
| B | 3679 | 3.9% |
| D | 3679 | 3.9% |
| L | 3481 | 3.7% |
| G | 3459 | 3.6% |
| Other values (33) | 51717 |
Other Letter
| Value | Count | Frequency (%) |
| リ | 3 | 8.1% |
| い | 3 | 8.1% |
| 男 | 3 | 8.1% |
| は | 3 | 8.1% |
| つ | 3 | 8.1% |
| ら | 3 | 8.1% |
| ズ | 3 | 8.1% |
| よ | 3 | 8.1% |
| シ | 3 | 8.1% |
| 즈 | 2 | 5.4% |
| Other values (4) | 8 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 59197 | |
| : | 18055 | 17.1% |
| , | 13546 | 12.8% |
| . | 7379 | 7.0% |
| / | 7228 | 6.8% |
| " | 214 | 0.2% |
| & | 52 | < 0.1% |
| ! | 35 | < 0.1% |
| * | 4 | < 0.1% |
| ? | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6788 | |
| 2 | 6105 | |
| 3 | 5870 | |
| 4 | 5779 | |
| 5 | 5699 | |
| 9 | 5476 | |
| 8 | 5450 | |
| 6 | 5367 | |
| 7 | 5345 | |
| 0 | 5044 |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 4489 | |
| ( | 330 | 6.8% |
| [ | 5 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 4489 | |
| ) | 330 | 6.8% |
| ] | 5 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 160 | |
| – | 2 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 41722 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8978 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 9 |
Modifier Letter
| Value | Count | Frequency (%) |
| ー | 3 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 411534 | |
| Common | 223164 | |
| Cyrillic | 414 | 0.1% |
| Hiragana | 15 | < 0.1% |
| Hangul | 10 | < 0.1% |
| Katakana | 9 | < 0.1% |
| Han | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| p | 29065 | 7.1% |
| a | 25697 | 6.2% |
| o | 25031 | 6.1% |
| e | 24216 | 5.9% |
| t | 23190 | 5.6% |
| n | 16720 | 4.1% |
| r | 15819 | 3.8% |
| i | 15328 | 3.7% |
| h | 14433 | 3.5% |
| d | 13697 | 3.3% |
| Other values (70) | 208338 |
Cyrillic
| Value | Count | Frequency (%) |
| л | 48 | 11.6% |
| и | 41 | 9.9% |
| о | 37 | 8.9% |
| к | 30 | 7.2% |
| е | 27 | 6.5% |
| я | 25 | 6.0% |
| а | 17 | 4.1% |
| К | 16 | 3.9% |
| ц | 16 | 3.9% |
| р | 14 | 3.4% |
| Other values (32) | 143 |
Common
| Value | Count | Frequency (%) |
| ' | 59197 | |
| 41722 | ||
| : | 18055 | 8.1% |
| , | 13546 | 6.1% |
| _ | 8978 | 4.0% |
| . | 7379 | 3.3% |
| / | 7228 | 3.2% |
| 1 | 6788 | 3.0% |
| 2 | 6105 | 2.7% |
| 3 | 5870 | 2.6% |
| Other values (24) | 48296 |
Hiragana
| Value | Count | Frequency (%) |
| い | 3 | |
| は | 3 | |
| つ | 3 | |
| ら | 3 | |
| よ | 3 |
Hangul
| Value | Count | Frequency (%) |
| 즈 | 2 | |
| 리 | 2 | |
| 시 | 2 | |
| 식 | 2 | |
| 객 | 2 |
Katakana
| Value | Count | Frequency (%) |
| リ | 3 | |
| ズ | 3 | |
| シ | 3 |
Han
| Value | Count | Frequency (%) |
| 男 | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 634435 | |
| Cyrillic | 414 | 0.1% |
| None | 246 | < 0.1% |
| Hiragana | 15 | < 0.1% |
| Punctuation | 14 | < 0.1% |
| Katakana | 12 | < 0.1% |
| Hangul | 10 | < 0.1% |
| CJK | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 59197 | 9.3% |
| 41722 | 6.6% | |
| p | 29065 | 4.6% |
| a | 25697 | 4.1% |
| o | 25031 | 3.9% |
| e | 24216 | 3.8% |
| t | 23190 | 3.7% |
| : | 18055 | 2.8% |
| n | 16720 | 2.6% |
| r | 15819 | 2.5% |
| Other values (71) | 355723 |
Cyrillic
| Value | Count | Frequency (%) |
| л | 48 | 11.6% |
| и | 41 | 9.9% |
| о | 37 | 8.9% |
| к | 30 | 7.2% |
| е | 27 | 6.5% |
| я | 25 | 6.0% |
| а | 17 | 4.1% |
| К | 16 | 3.9% |
| ц | 16 | 3.9% |
| р | 14 | 3.4% |
| Other values (32) | 143 |
None
| Value | Count | Frequency (%) |
| é | 45 | |
| ä | 40 | |
| ô | 35 | |
| ò | 28 | |
| ö | 19 | |
| ó | 14 | 5.7% |
| ı | 14 | 5.7% |
| í | 9 | 3.7% |
| á | 4 | 1.6% |
| İ | 4 | 1.6% |
| Other values (19) | 34 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 9 | |
| … | 3 | 21.4% |
| – | 2 | 14.3% |
Katakana
| Value | Count | Frequency (%) |
| リ | 3 | |
| ー | 3 | |
| ズ | 3 | |
| シ | 3 |
Hiragana
| Value | Count | Frequency (%) |
| い | 3 | |
| は | 3 | |
| つ | 3 | |
| ら | 3 | |
| よ | 3 |
CJK
| Value | Count | Frequency (%) |
| 男 | 3 |
Hangul
| Value | Count | Frequency (%) |
| 즈 | 2 | |
| 리 | 2 | |
| 시 | 2 | |
| 식 | 2 | |
| 객 | 2 |
budget
Real number (ℝ)
ZEROS 
| Distinct | 1223 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4232324.568 |
| Minimum | 0 |
|---|---|
| Maximum | 380000000 |
| Zeros | 36493 |
| Zeros (%) | 80.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 25000000 |
| Maximum | 380000000 |
| Range | 380000000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 17439317.02 |
|---|---|
| Coefficient of variation (CV) | 4.120505584 |
| Kurtosis | 66.63900958 |
| Mean | 4232324.568 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.118579439 |
| Sum | 1.920586566 × 1011 |
| Variance | 3.04129778 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36493 | |
| 5000000 | 286 | 0.6% |
| 10000000 | 259 | 0.6% |
| 20000000 | 243 | 0.5% |
| 2000000 | 242 | 0.5% |
| 15000000 | 226 | 0.5% |
| 3000000 | 223 | 0.5% |
| 25000000 | 206 | 0.5% |
| 1000000 | 197 | 0.4% |
| 30000000 | 190 | 0.4% |
| Other values (1213) | 6814 | 15.0% |
| Value | Count | Frequency (%) |
| 0 | 36493 | |
| 1 | 25 | 0.1% |
| 2 | 14 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 380000000 | 1 | < 0.1% |
| 300000000 | 1 | < 0.1% |
| 280000000 | 1 | < 0.1% |
| 270000000 | 1 | < 0.1% |
| 260000000 | 3 |
TV Movie
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.01690209128 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 44612 |
| Zeros (%) | 98.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1289060773 |
|---|---|
| Coefficient of variation (CV) | 7.626634787 |
| Kurtosis | 54.18757167 |
| Mean | 0.01690209128 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.495677651 |
| Sum | 767 |
| Variance | 0.01661677676 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 44612 | |
| 1 | 767 | 1.7% |
| Value | Count | Frequency (%) |
| 0 | 44612 | |
| 1 | 767 | 1.7% |
| Value | Count | Frequency (%) |
| 1 | 767 | 1.7% |
| 0 | 44612 |
Western
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02296216312 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 44337 |
| Zeros (%) | 97.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1497845005 |
|---|---|
| Coefficient of variation (CV) | 6.523100621 |
| Kurtosis | 38.57778855 |
| Mean | 0.02296216312 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.369936287 |
| Sum | 1042 |
| Variance | 0.02243539658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 44337 | |
| 1 | 1042 | 2.3% |
| Value | Count | Frequency (%) |
| 0 | 44337 | |
| 1 | 1042 | 2.3% |
| Value | Count | Frequency (%) |
| 1 | 1042 | 2.3% |
| 0 | 44337 |
Documentary
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.08640560612 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 41458 |
| Zeros (%) | 91.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2809651526 |
|---|---|
| Coefficient of variation (CV) | 3.251700499 |
| Kurtosis | 6.668767757 |
| Mean | 0.08640560612 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.944227207 |
| Sum | 3921 |
| Variance | 0.07894141695 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 41458 | |
| 1 | 3921 | 8.6% |
| Value | Count | Frequency (%) |
| 0 | 41458 | |
| 1 | 3921 | 8.6% |
| Value | Count | Frequency (%) |
| 1 | 3921 | 8.6% |
| 0 | 41458 |
Music
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.03521452654 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 43781 |
| Zeros (%) | 96.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.184323662 |
|---|---|
| Coefficient of variation (CV) | 5.234307545 |
| Kurtosis | 23.43658602 |
| Mean | 0.03521452654 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.043367238 |
| Sum | 1598 |
| Variance | 0.03397521236 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 43781 | |
| 1 | 1598 | 3.5% |
| Value | Count | Frequency (%) |
| 0 | 43781 | |
| 1 | 1598 | 3.5% |
| Value | Count | Frequency (%) |
| 1 | 1598 | 3.5% |
| 0 | 43781 |
Foreign
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.03572136892 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 43758 |
| Zeros (%) | 96.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1855966373 |
|---|---|
| Coefficient of variation (CV) | 5.195675389 |
| Kurtosis | 23.03416264 |
| Mean | 0.03572136892 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.003313647 |
| Sum | 1621 |
| Variance | 0.03444611179 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 43758 | |
| 1 | 1621 | 3.6% |
| Value | Count | Frequency (%) |
| 0 | 43758 | |
| 1 | 1621 | 3.6% |
| Value | Count | Frequency (%) |
| 1 | 1621 | 3.6% |
| 0 | 43758 |
War
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0291544547 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 44056 |
| Zeros (%) | 97.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1682411847 |
|---|---|
| Coefficient of variation (CV) | 5.770685351 |
| Kurtosis | 29.33346972 |
| Mean | 0.0291544547 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.597515243 |
| Sum | 1323 |
| Variance | 0.02830509622 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 44056 | |
| 1 | 1323 | 2.9% |
| Value | Count | Frequency (%) |
| 0 | 44056 | |
| 1 | 1323 | 2.9% |
| Value | Count | Frequency (%) |
| 1 | 1323 | 2.9% |
| 0 | 44056 |
Mystery
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05429824368 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 42915 |
| Zeros (%) | 94.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2266077581 |
|---|---|
| Coefficient of variation (CV) | 4.1733902 |
| Kurtosis | 13.47583475 |
| Mean | 0.05429824368 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.933858262 |
| Sum | 2464 |
| Variance | 0.05135107602 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 42915 | |
| 1 | 2464 | 5.4% |
| Value | Count | Frequency (%) |
| 0 | 42915 | |
| 1 | 2464 | 5.4% |
| Value | Count | Frequency (%) |
| 1 | 2464 | 5.4% |
| 0 | 42915 |
Science Fiction
Real number (ℝ)
CONSTANT  ZEROS 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0 |
| Minimum | 0 |
|---|---|
| Maximum | 0 |
| Zeros | 45379 |
| Zeros (%) | 100.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 0 |
| Range | 0 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0 |
|---|---|
| Coefficient of variation (CV) | nan |
| Kurtosis | 0 |
| Mean | 0 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0 |
| Sum | 0 |
| Variance | 0 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 0 | 45379 |
| Value | Count | Frequency (%) |
| 0 | 45379 |
| Value | Count | Frequency (%) |
| 0 | 45379 |
History
Real number (ℝ)
CONSTANT  ZEROS 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0 |
| Minimum | 0 |
|---|---|
| Maximum | 0 |
| Zeros | 45379 |
| Zeros (%) | 100.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 0 |
| Range | 0 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0 |
|---|---|
| Coefficient of variation (CV) | nan |
| Kurtosis | 0 |
| Mean | 0 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0 |
| Sum | 0 |
| Variance | 0 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 0 | 45379 |
| Value | Count | Frequency (%) |
| 0 | 45379 |
| Value | Count | Frequency (%) |
| 0 | 45379 |
Horror
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1029330748 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 40708 |
| Zeros (%) | 89.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3038747962 |
|---|---|
| Coefficient of variation (CV) | 2.952158933 |
| Kurtosis | 4.830458777 |
| Mean | 0.1029330748 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.613473911 |
| Sum | 4671 |
| Variance | 0.09233989175 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 40708 | |
| 1 | 4671 | 10.3% |
| Value | Count | Frequency (%) |
| 0 | 40708 | |
| 1 | 4671 | 10.3% |
| Value | Count | Frequency (%) |
| 1 | 4671 | 10.3% |
| 0 | 40708 |
Thriller
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1679190815 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 37759 |
| Zeros (%) | 83.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3737985322 |
|---|---|
| Coefficient of variation (CV) | 2.226063463 |
| Kurtosis | 1.157315265 |
| Mean | 0.1679190815 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.77686923 |
| Sum | 7620 |
| Variance | 0.1397253427 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37759 | |
| 1 | 7620 | 16.8% |
| Value | Count | Frequency (%) |
| 0 | 37759 | |
| 1 | 7620 | 16.8% |
| Value | Count | Frequency (%) |
| 1 | 7620 | 16.8% |
| 0 | 37759 |
Crime
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09486767007 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 41074 |
| Zeros (%) | 90.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2930353008 |
|---|---|
| Coefficient of variation (CV) | 3.088884766 |
| Kurtosis | 5.646564022 |
| Mean | 0.09486767007 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.765197129 |
| Sum | 4305 |
| Variance | 0.08586968752 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 41074 | |
| 1 | 4305 | 9.5% |
| Value | Count | Frequency (%) |
| 0 | 41074 | |
| 1 | 4305 | 9.5% |
| Value | Count | Frequency (%) |
| 1 | 4305 | 9.5% |
| 0 | 41074 |
Action
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1453095044 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 38785 |
| Zeros (%) | 85.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3524164996 |
|---|---|
| Coefficient of variation (CV) | 2.425281822 |
| Kurtosis | 2.052234811 |
| Mean | 0.1453095044 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.012993881 |
| Sum | 6594 |
| Variance | 0.1241973892 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38785 | |
| 1 | 6594 | 14.5% |
| Value | Count | Frequency (%) |
| 0 | 38785 | |
| 1 | 6594 | 14.5% |
| Value | Count | Frequency (%) |
| 1 | 6594 | 14.5% |
| 0 | 38785 |
Drama
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4463738734 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 25123 |
| Zeros (%) | 55.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.4971213981 |
|---|---|
| Coefficient of variation (CV) | 1.113688385 |
| Kurtosis | -1.9535354 |
| Mean | 0.4463738734 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.2157561127 |
| Sum | 20256 |
| Variance | 0.2471296844 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 25123 | |
| 1 | 20256 |
| Value | Count | Frequency (%) |
| 0 | 25123 | |
| 1 | 20256 |
| Value | Count | Frequency (%) |
| 1 | 20256 | |
| 0 | 25123 |
Romance
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1483725953 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 38646 |
| Zeros (%) | 85.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.355472858 |
|---|---|
| Coefficient of variation (CV) | 2.395812093 |
| Kurtosis | 1.914354668 |
| Mean | 0.1483725953 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.97845149 |
| Sum | 6733 |
| Variance | 0.1263609528 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38646 | |
| 1 | 6733 | 14.8% |
| Value | Count | Frequency (%) |
| 0 | 38646 | |
| 1 | 6733 | 14.8% |
| Value | Count | Frequency (%) |
| 1 | 6733 | 14.8% |
| 0 | 38646 |
Fantasy
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05097071333 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 43066 |
| Zeros (%) | 94.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2199403685 |
|---|---|
| Coefficient of variation (CV) | 4.315034147 |
| Kurtosis | 14.6745667 |
| Mean | 0.05097071333 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.08337115 |
| Sum | 2313 |
| Variance | 0.0483737657 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 43066 | |
| 1 | 2313 | 5.1% |
| Value | Count | Frequency (%) |
| 0 | 43066 | |
| 1 | 2313 | 5.1% |
| Value | Count | Frequency (%) |
| 1 | 2313 | 5.1% |
| 0 | 43066 |
Adventure
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0769959673 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 41885 |
| Zeros (%) | 92.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2665879863 |
|---|---|
| Coefficient of variation (CV) | 3.462362974 |
| Kurtosis | 8.072133676 |
| Mean | 0.0769959673 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.173606452 |
| Sum | 3494 |
| Variance | 0.07106915444 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 41885 | |
| 1 | 3494 | 7.7% |
| Value | Count | Frequency (%) |
| 0 | 41885 | |
| 1 | 3494 | 7.7% |
| Value | Count | Frequency (%) |
| 1 | 3494 | 7.7% |
| 0 | 41885 |
Family
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06099737764 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 42611 |
| Zeros (%) | 93.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2393281425 |
|---|---|
| Coefficient of variation (CV) | 3.923580844 |
| Kurtosis | 11.46050208 |
| Mean | 0.06099737764 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.668786854 |
| Sum | 2768 |
| Variance | 0.05727795978 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 42611 | |
| 1 | 2768 | 6.1% |
| Value | Count | Frequency (%) |
| 0 | 42611 | |
| 1 | 2768 | 6.1% |
| Value | Count | Frequency (%) |
| 1 | 2768 | 6.1% |
| 0 | 42611 |
Comedy
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2904647524 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 32198 |
| Zeros (%) | 71.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.4539818518 |
|---|---|
| Coefficient of variation (CV) | 1.562949886 |
| Kurtosis | -1.147862485 |
| Mean | 0.2904647524 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9231403505 |
| Sum | 13181 |
| Variance | 0.2060995218 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 32198 | |
| 1 | 13181 |
| Value | Count | Frequency (%) |
| 0 | 32198 | |
| 1 | 13181 |
| Value | Count | Frequency (%) |
| 1 | 13181 | |
| 0 | 32198 |
Animation
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.04259679587 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 43446 |
| Zeros (%) | 95.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2019485271 |
|---|---|
| Coefficient of variation (CV) | 4.740932338 |
| Kurtosis | 18.52260917 |
| Mean | 0.04259679587 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.530098545 |
| Sum | 1933 |
| Variance | 0.04078320758 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 43446 | |
| 1 | 1933 | 4.3% |
| Value | Count | Frequency (%) |
| 0 | 43446 | |
| 1 | 1933 | 4.3% |
| Value | Count | Frequency (%) |
| 1 | 1933 | 4.3% |
| 0 | 43446 |
genres
Text
| Distinct | 4066 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.6 KiB |
Length
| Max length | 264 |
|---|---|
| Median length | 225 |
| Mean length | 62.89292404 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2854018 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2365 ? |
|---|---|
| Unique (%) | 5.2% |
Sample
| 1st row | [{'id': 16, 'name': 'Animation'}, {'id': 35, 'name': 'Comedy'}, {'id': 10751, 'name': 'Family'}] |
|---|---|
| 2nd row | [{'id': 12, 'name': 'Adventure'}, {'id': 14, 'name': 'Fantasy'}, {'id': 10751, 'name': 'Family'}] |
| 3rd row | [{'id': 10749, 'name': 'Romance'}, {'id': 35, 'name': 'Comedy'}] |
| 4th row | [{'id': 35, 'name': 'Comedy'}, {'id': 18, 'name': 'Drama'}, {'id': 10749, 'name': 'Romance'}] |
| 5th row | [{'id': 35, 'name': 'Comedy'}] |
| Value | Count | Frequency (%) |
| id | 91045 | |
| name | 91045 | |
| drama | 20256 | 5.5% |
| 18 | 20256 | 5.5% |
| 35 | 13181 | 3.6% |
| comedy | 13181 | 3.6% |
| 53 | 7620 | 2.1% |
| thriller | 7620 | 2.1% |
| 10749 | 6733 | 1.8% |
| romance | 6733 | 1.8% |
| Other values (35) | 92705 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 546270 | |
| 324996 | 11.4% | |
| : | 182090 | 6.4% |
| a | 152861 | 5.4% |
| e | 146817 | 5.1% |
| m | 144142 | 5.1% |
| , | 139095 | 4.9% |
| i | 130713 | 4.6% |
| n | 126717 | 4.4% |
| d | 107720 | 3.8% |
| Other values (36) | 852597 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1058604 | |
| Other Punctuation | 867455 | |
| Space Separator | 324996 | 11.4% |
| Decimal Number | 234492 | 8.2% |
| Close Punctuation | 136424 | 4.8% |
| Open Punctuation | 136424 | 4.8% |
| Uppercase Letter | 95623 | 3.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 152861 | |
| e | 146817 | |
| m | 144142 | |
| i | 130713 | |
| n | 126717 | |
| d | 107720 | |
| r | 69076 | |
| o | 48533 | 4.6% |
| y | 28508 | 2.7% |
| c | 27978 | 2.6% |
| Other values (7) | 75539 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 24177 | |
| C | 17486 | |
| A | 12021 | |
| F | 9746 | |
| T | 8387 | 8.8% |
| R | 6733 | 7.0% |
| H | 6068 | 6.3% |
| M | 4829 | 5.1% |
| S | 3044 | 3.2% |
| W | 2365 | 2.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 45574 | |
| 8 | 39707 | |
| 5 | 24892 | |
| 3 | 23240 | |
| 7 | 22736 | |
| 0 | 21480 | |
| 9 | 18660 | |
| 2 | 17680 | 7.5% |
| 4 | 13108 | 5.6% |
| 6 | 7415 | 3.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 546270 | |
| : | 182090 | 21.0% |
| , | 139095 | 16.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 91045 | |
| ] | 45379 |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 91045 | |
| [ | 45379 |
Space Separator
| Value | Count | Frequency (%) |
| 324996 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1699791 | |
| Latin | 1154227 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 152861 | |
| e | 146817 | |
| m | 144142 | |
| i | 130713 | |
| n | 126717 | |
| d | 107720 | |
| r | 69076 | |
| o | 48533 | 4.2% |
| y | 28508 | 2.5% |
| c | 27978 | 2.4% |
| Other values (18) | 171162 |
Common
| Value | Count | Frequency (%) |
| ' | 546270 | |
| 324996 | ||
| : | 182090 | 10.7% |
| , | 139095 | 8.2% |
| } | 91045 | 5.4% |
| { | 91045 | 5.4% |
| 1 | 45574 | 2.7% |
| ] | 45379 | 2.7% |
| [ | 45379 | 2.7% |
| 8 | 39707 | 2.3% |
| Other values (8) | 149211 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2854018 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 546270 | |
| 324996 | 11.4% | |
| : | 182090 | 6.4% |
| a | 152861 | 5.4% |
| e | 146817 | 5.1% |
| m | 144142 | 5.1% |
| , | 139095 | 4.9% |
| i | 130713 | 4.6% |
| n | 126717 | 4.4% |
| d | 107720 | 3.8% |
| Other values (36) | 852597 |
id
Real number (ℝ)
| Distinct | 45349 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108029.979 |
| Minimum | 2 |
|---|---|
| Maximum | 469172 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 5351.3 |
| Q1 | 26386.5 |
| median | 59859 |
| Q3 | 156538 |
| 95-th percentile | 357170.8 |
| Maximum | 469172 |
| Range | 469170 |
| Interquartile range (IQR) | 130151.5 |
Descriptive statistics
| Standard deviation | 112166.7138 |
|---|---|
| Coefficient of variation (CV) | 1.038292471 |
| Kurtosis | 0.5594153218 |
| Mean | 108029.979 |
| Median Absolute Deviation (MAD) | 44419 |
| Skewness | 1.283008053 |
| Sum | 4902292415 |
| Variance | 1.258137168 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 141971 | 3 | < 0.1% |
| 23305 | 2 | < 0.1% |
| 168538 | 2 | < 0.1% |
| 109962 | 2 | < 0.1% |
| 119916 | 2 | < 0.1% |
| 97995 | 2 | < 0.1% |
| 159849 | 2 | < 0.1% |
| 84198 | 2 | < 0.1% |
| 132641 | 2 | < 0.1% |
| 99080 | 2 | < 0.1% |
| Other values (45339) | 45358 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 11 | 1 |
| Value | Count | Frequency (%) |
| 469172 | 1 | |
| 468707 | 1 | |
| 468343 | 1 | |
| 467731 | 1 | |
| 465044 | 1 |
| Distinct | 89 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Memory size | 354.6 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 90736 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
| Value | Count | Frequency (%) |
| en | 32204 | |
| fr | 2437 | 5.4% |
| it | 1528 | 3.4% |
| ja | 1350 | 3.0% |
| de | 1078 | 2.4% |
| es | 992 | 2.2% |
| ru | 822 | 1.8% |
| hi | 508 | 1.1% |
| ko | 444 | 1.0% |
| zh | 408 | 0.9% |
| Other values (79) | 3597 | 7.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 34529 | |
| n | 32912 | |
| r | 3630 | 4.0% |
| f | 2835 | 3.1% |
| i | 2388 | 2.6% |
| t | 2250 | 2.5% |
| a | 1840 | 2.0% |
| s | 1652 | 1.8% |
| j | 1351 | 1.5% |
| d | 1323 | 1.5% |
| Other values (16) | 6026 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 90736 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 34529 | |
| n | 32912 | |
| r | 3630 | 4.0% |
| f | 2835 | 3.1% |
| i | 2388 | 2.6% |
| t | 2250 | 2.5% |
| a | 1840 | 2.0% |
| s | 1652 | 1.8% |
| j | 1351 | 1.5% |
| d | 1323 | 1.5% |
| Other values (16) | 6026 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 90736 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 34529 | |
| n | 32912 | |
| r | 3630 | 4.0% |
| f | 2835 | 3.1% |
| i | 2388 | 2.6% |
| t | 2250 | 2.5% |
| a | 1840 | 2.0% |
| s | 1652 | 1.8% |
| j | 1351 | 1.5% |
| d | 1323 | 1.5% |
| Other values (16) | 6026 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 90736 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 34529 | |
| n | 32912 | |
| r | 3630 | 4.0% |
| f | 2835 | 3.1% |
| i | 2388 | 2.6% |
| t | 2250 | 2.5% |
| a | 1840 | 2.0% |
| s | 1652 | 1.8% |
| j | 1351 | 1.5% |
| d | 1323 | 1.5% |
| Other values (16) | 6026 | 6.6% |
overview
Text
MISSING 
| Distinct | 44235 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 941 |
| Missing (%) | 2.1% |
| Memory size | 354.6 KiB |
Length
| Max length | 1000 |
|---|---|
| Median length | 786 |
| Mean length | 323.2921374 |
| Min length | 1 |
Characters and Unicode
| Total characters | 14366456 |
|---|---|
| Distinct characters | 429 |
| Distinct categories | 25 ? |
| Distinct scripts | 13 ? |
| Distinct blocks | 21 ? |
Unique
| Unique | 44176 ? |
|---|---|
| Unique (%) | 99.4% |
Sample
| 1st row | Led by Woody, Andy's toys live happily in his room until Andy's birthday brings Buzz Lightyear onto the scene. Afraid of losing his place in Andy's heart, Woody plots against Buzz. But when circumstances separate Buzz and Woody from their owner, the duo eventually learns to put aside their differences. |
|---|---|
| 2nd row | When siblings Judy and Peter discover an enchanted board game that opens the door to a magical world, they unwittingly invite Alan -- an adult who's been trapped inside the game for 26 years -- into their living room. Alan's only hope for freedom is to finish the game, which proves risky as all three find themselves running from giant rhinoceroses, evil monkeys and other terrifying creatures. |
| 3rd row | A family wedding reignites the ancient feud between next-door neighbors and fishing buddies John and Max. Meanwhile, a sultry Italian divorcée opens a restaurant at the local bait shop, alarming the locals who worry she'll scare the fish away. But she's less interested in seafood than she is in cooking up a hot time with Max. |
| 4th row | Cheated on, mistreated and stepped on, the women are holding their breath, waiting for the elusive "good man" to break a string of less-than-stellar lovers. Friends and confidants Vannah, Bernie, Glo and Robin talk it all out, determined to find a better way to breathe. |
| 5th row | Just when George Banks has recovered from his daughter's wedding, he receives the news that she's pregnant ... and that George's wife, Nina, is expecting too. He was planning on selling their home, but that's a plan that -- like George -- will have to change with the arrival of both a grandchild and a kid of his own. |
| Value | Count | Frequency (%) |
| the | 138089 | 5.6% |
| a | 98898 | 4.0% |
| and | 75263 | 3.1% |
| to | 73327 | 3.0% |
| of | 69578 | 2.8% |
| in | 48145 | 2.0% |
| is | 36502 | 1.5% |
| his | 36165 | 1.5% |
| with | 23904 | 1.0% |
| her | 21485 | 0.9% |
| Other values (97093) | 1827488 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2406483 | ||
| e | 1363851 | 9.5% |
| a | 940546 | 6.5% |
| t | 934830 | 6.5% |
| i | 851558 | 5.9% |
| o | 829919 | 5.8% |
| n | 822640 | 5.7% |
| s | 767891 | 5.3% |
| r | 744333 | 5.2% |
| h | 600843 | 4.2% |
| Other values (419) | 4103562 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11150647 | |
| Space Separator | 2406521 | 16.8% |
| Uppercase Letter | 390984 | 2.7% |
| Other Punctuation | 312833 | 2.2% |
| Decimal Number | 42223 | 0.3% |
| Dash Punctuation | 36768 | 0.3% |
| Close Punctuation | 10100 | 0.1% |
| Open Punctuation | 10077 | 0.1% |
| Final Punctuation | 4556 | < 0.1% |
| Initial Punctuation | 882 | < 0.1% |
| Other values (15) | 865 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1363851 | |
| a | 940546 | 8.4% |
| t | 934830 | 8.4% |
| i | 851558 | 7.6% |
| o | 829919 | 7.4% |
| n | 822640 | 7.4% |
| s | 767891 | 6.9% |
| r | 744333 | 6.7% |
| h | 600843 | 5.4% |
| l | 478832 | 4.3% |
| Other values (142) | 2815404 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 42754 | 10.9% |
| T | 35969 | 9.2% |
| S | 31129 | 8.0% |
| M | 23955 | 6.1% |
| B | 23703 | 6.1% |
| C | 22804 | 5.8% |
| H | 19429 | 5.0% |
| W | 18653 | 4.8% |
| I | 16799 | 4.3% |
| D | 16311 | 4.2% |
| Other values (77) | 139478 |
Other Letter
| Value | Count | Frequency (%) |
| र | 6 | 4.8% |
| न | 6 | 4.8% |
| म | 5 | 4.0% |
| の | 4 | 3.2% |
| प | 3 | 2.4% |
| द | 3 | 2.4% |
| ద | 3 | 2.4% |
| अ | 3 | 2.4% |
| व | 2 | 1.6% |
| م | 2 | 1.6% |
| Other values (76) | 88 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 133443 | |
| . | 124802 | |
| ' | 31122 | 9.9% |
| " | 11661 | 3.7% |
| : | 3299 | 1.1% |
| ? | 2759 | 0.9% |
| ; | 2493 | 0.8% |
| ! | 1543 | 0.5% |
| / | 765 | 0.2% |
| & | 453 | 0.1% |
| Other values (12) | 493 | 0.2% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ి | 4 | |
| ் | 3 | |
| ్ | 3 | |
| ् | 3 | |
| ̈ | 3 | |
| ా | 2 | 6.1% |
| े | 2 | 6.1% |
| ं | 2 | 6.1% |
| ु | 2 | 6.1% |
| Other values (4) | 5 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9748 | |
| 0 | 8265 | |
| 9 | 6405 | |
| 2 | 4251 | |
| 5 | 2440 | 5.8% |
| 8 | 2379 | 5.6% |
| 3 | 2342 | 5.5% |
| 4 | 2176 | 5.2% |
| 7 | 2131 | 5.0% |
| 6 | 2086 | 4.9% |
Spacing Mark
| Value | Count | Frequency (%) |
| ा | 11 | |
| ी | 4 | 14.8% |
| ो | 3 | 11.1% |
| ు | 3 | 11.1% |
| ि | 2 | 7.4% |
| ு | 2 | 7.4% |
| ం | 1 | 3.7% |
| ி | 1 | 3.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 35245 | |
| – | 881 | 2.4% |
| — | 633 | 1.7% |
| ― | 5 | < 0.1% |
| ‐ | 4 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ® | 45 | |
| ™ | 14 | 21.9% |
| ¦ | 2 | 3.1% |
| ° | 2 | 3.1% |
| � | 1 | 1.6% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 20 | |
| + | 11 | |
| = | 6 | 15.0% |
| | | 2 | 5.0% |
| − | 1 | 2.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10024 | |
| [ | 50 | 0.5% |
| { | 2 | < 0.1% |
| „ | 1 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 317 | |
| £ | 10 | 3.0% |
| ₹ | 1 | 0.3% |
| € | 1 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2406483 | ||
| 36 | < 0.1% | |
| 2 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10048 | |
| ] | 50 | 0.5% |
| } | 2 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 3847 | |
| ” | 690 | 15.1% |
| » | 19 | 0.4% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 672 | |
| ‘ | 192 | 21.8% |
| « | 18 | 2.0% |
Control
| Value | Count | Frequency (%) |
| 106 | ||
| | 3 | 2.7% |
| | 1 | 0.9% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 25 | |
| ` | 12 | |
| ¯ | 1 | 2.6% |
Format
| Value | Count | Frequency (%) |
| | 31 | |
| | 20 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 8 | |
| ¹ | 8 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 19 |
Line Separator
| Value | Count | Frequency (%) |
| 7 |
Letter Number
| Value | Count | Frequency (%) |
| Ⅱ | 2 |
Paragraph Separator
| Value | Count | Frequency (%) |
| 2 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11536399 | |
| Common | 2824638 | 19.7% |
| Cyrillic | 4587 | < 0.1% |
| Greek | 648 | < 0.1% |
| Devanagari | 77 | < 0.1% |
| Telugu | 30 | < 0.1% |
| Hiragana | 20 | < 0.1% |
| Tamil | 19 | < 0.1% |
| Han | 10 | < 0.1% |
| Hangul | 9 | < 0.1% |
| Other values (3) | 19 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1363851 | |
| a | 940546 | 8.2% |
| t | 934830 | 8.1% |
| i | 851558 | 7.4% |
| o | 829919 | 7.2% |
| n | 822640 | 7.1% |
| s | 767891 | 6.7% |
| r | 744333 | 6.5% |
| h | 600843 | 5.2% |
| l | 478832 | 4.2% |
| Other values (132) | 3201156 |
Common
| Value | Count | Frequency (%) |
| 2406483 | ||
| , | 133443 | 4.7% |
| . | 124802 | 4.4% |
| - | 35245 | 1.2% |
| ' | 31122 | 1.1% |
| " | 11661 | 0.4% |
| ) | 10048 | 0.4% |
| ( | 10024 | 0.4% |
| 1 | 9748 | 0.3% |
| 0 | 8265 | 0.3% |
| Other values (71) | 43797 | 1.6% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 470 | 10.2% |
| е | 404 | 8.8% |
| а | 373 | 8.1% |
| н | 323 | 7.0% |
| и | 299 | 6.5% |
| т | 265 | 5.8% |
| р | 240 | 5.2% |
| с | 218 | 4.8% |
| в | 173 | 3.8% |
| л | 161 | 3.5% |
| Other values (46) | 1661 |
Greek
| Value | Count | Frequency (%) |
| α | 60 | 9.3% |
| ο | 55 | 8.5% |
| τ | 43 | 6.6% |
| ι | 36 | 5.6% |
| η | 36 | 5.6% |
| ν | 34 | 5.2% |
| ε | 31 | 4.8% |
| ρ | 31 | 4.8% |
| π | 30 | 4.6% |
| ς | 30 | 4.6% |
| Other values (33) | 262 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 11 | 14.3% |
| र | 6 | 7.8% |
| न | 6 | 7.8% |
| म | 5 | 6.5% |
| ी | 4 | 5.2% |
| ् | 3 | 3.9% |
| ो | 3 | 3.9% |
| प | 3 | 3.9% |
| द | 3 | 3.9% |
| अ | 3 | 3.9% |
| Other values (21) | 30 |
Hiragana
| Value | Count | Frequency (%) |
| の | 4 | |
| さ | 1 | 5.0% |
| ん | 1 | 5.0% |
| と | 1 | 5.0% |
| そ | 1 | 5.0% |
| め | 1 | 5.0% |
| ひ | 1 | 5.0% |
| ち | 1 | 5.0% |
| ず | 1 | 5.0% |
| か | 1 | 5.0% |
| Other values (7) | 7 |
Telugu
| Value | Count | Frequency (%) |
| ి | 4 | |
| ్ | 3 | |
| ు | 3 | |
| ద | 3 | |
| న | 2 | 6.7% |
| స | 2 | 6.7% |
| ా | 2 | 6.7% |
| మ | 2 | 6.7% |
| ర | 2 | 6.7% |
| బ | 1 | 3.3% |
| Other values (6) | 6 |
Tamil
| Value | Count | Frequency (%) |
| ் | 3 | |
| ம | 2 | |
| ர | 2 | |
| ு | 2 | |
| ப | 2 | |
| ன | 1 | 5.3% |
| வ | 1 | 5.3% |
| த | 1 | 5.3% |
| ஆ | 1 | 5.3% |
| ய | 1 | 5.3% |
| Other values (3) | 3 |
Han
| Value | Count | Frequency (%) |
| 俣 | 1 | |
| 界 | 1 | |
| 患 | 1 | |
| 者 | 1 | |
| 世 | 1 | |
| 水 | 1 | |
| 鬼 | 1 | |
| 見 | 1 | |
| 難 | 1 | |
| 海 | 1 |
Hangul
| Value | Count | Frequency (%) |
| 사 | 2 | |
| 회 | 1 | |
| 식 | 1 | |
| 주 | 1 | |
| 기 | 1 | |
| 찾 | 1 | |
| 랑 | 1 | |
| 첫 | 1 |
Thai
| Value | Count | Frequency (%) |
| ่ | 2 | |
| ง | 1 | |
| ร | 1 | |
| พ | 1 | |
| แ | 1 | |
| ี | 1 | |
| ส | 1 |
Arabic
| Value | Count | Frequency (%) |
| م | 2 | |
| ہ | 1 | |
| ت | 1 |
Inherited
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ̈ | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14348456 | |
| Punctuation | 7270 | 0.1% |
| None | 5932 | < 0.1% |
| Cyrillic | 4587 | < 0.1% |
| Devanagari | 77 | < 0.1% |
| Telugu | 30 | < 0.1% |
| Hiragana | 20 | < 0.1% |
| Tamil | 19 | < 0.1% |
| Letterlike Symbols | 14 | < 0.1% |
| CJK | 10 | < 0.1% |
| Other values (11) | 41 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2406483 | ||
| e | 1363851 | 9.5% |
| a | 940546 | 6.6% |
| t | 934830 | 6.5% |
| i | 851558 | 5.9% |
| o | 829919 | 5.8% |
| n | 822640 | 5.7% |
| s | 767891 | 5.4% |
| r | 744333 | 5.2% |
| h | 600843 | 4.2% |
| Other values (82) | 4085562 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 3847 | |
| – | 881 | 12.1% |
| ” | 690 | 9.5% |
| “ | 672 | 9.2% |
| — | 633 | 8.7% |
| … | 303 | 4.2% |
| ‘ | 192 | 2.6% |
| | 31 | 0.4% |
| 7 | 0.1% | |
| ― | 5 | 0.1% |
| Other values (4) | 9 | 0.1% |
None
| Value | Count | Frequency (%) |
| é | 1552 | |
| ä | 294 | 5.0% |
| á | 293 | 4.9% |
| ö | 250 | 4.2% |
| í | 243 | 4.1% |
| è | 209 | 3.5% |
| ü | 178 | 3.0% |
| ı | 165 | 2.8% |
| ó | 164 | 2.8% |
| ç | 158 | 2.7% |
| Other values (141) | 2426 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 470 | 10.2% |
| е | 404 | 8.8% |
| а | 373 | 8.1% |
| н | 323 | 7.0% |
| и | 299 | 6.5% |
| т | 265 | 5.8% |
| р | 240 | 5.2% |
| с | 218 | 4.8% |
| в | 173 | 3.8% |
| л | 161 | 3.5% |
| Other values (46) | 1661 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 14 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 11 | 14.3% |
| र | 6 | 7.8% |
| न | 6 | 7.8% |
| म | 5 | 6.5% |
| ी | 4 | 5.2% |
| ् | 3 | 3.9% |
| ो | 3 | 3.9% |
| प | 3 | 3.9% |
| द | 3 | 3.9% |
| अ | 3 | 3.9% |
| Other values (21) | 30 |
Alphabetic PF
| Value | Count | Frequency (%) |
| fi | 4 |
Hiragana
| Value | Count | Frequency (%) |
| の | 4 | |
| さ | 1 | 5.0% |
| ん | 1 | 5.0% |
| と | 1 | 5.0% |
| そ | 1 | 5.0% |
| め | 1 | 5.0% |
| ひ | 1 | 5.0% |
| ち | 1 | 5.0% |
| ず | 1 | 5.0% |
| か | 1 | 5.0% |
| Other values (7) | 7 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ̈ | 3 |
Telugu
| Value | Count | Frequency (%) |
| ి | 4 | |
| ్ | 3 | |
| ు | 3 | |
| ద | 3 | |
| న | 2 | 6.7% |
| స | 2 | 6.7% |
| ా | 2 | 6.7% |
| మ | 2 | 6.7% |
| ర | 2 | 6.7% |
| బ | 1 | 3.3% |
| Other values (6) | 6 |
Tamil
| Value | Count | Frequency (%) |
| ் | 3 | |
| ம | 2 | |
| ர | 2 | |
| ு | 2 | |
| ப | 2 | |
| ன | 1 | 5.3% |
| வ | 1 | 5.3% |
| த | 1 | 5.3% |
| ஆ | 1 | 5.3% |
| ய | 1 | 5.3% |
| Other values (3) | 3 |
Arabic
| Value | Count | Frequency (%) |
| م | 2 | |
| ہ | 1 | |
| ت | 1 |
Hangul
| Value | Count | Frequency (%) |
| 사 | 2 | |
| 회 | 1 | |
| 식 | 1 | |
| 주 | 1 | |
| 기 | 1 | |
| 찾 | 1 | |
| 랑 | 1 | |
| 첫 | 1 |
Number Forms
| Value | Count | Frequency (%) |
| Ⅱ | 2 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 2 |
Thai
| Value | Count | Frequency (%) |
| ่ | 2 | |
| ง | 1 | |
| ร | 1 | |
| พ | 1 | |
| แ | 1 | |
| ี | 1 | |
| ส | 1 |
CJK
| Value | Count | Frequency (%) |
| 俣 | 1 | |
| 界 | 1 | |
| 患 | 1 | |
| 者 | 1 | |
| 世 | 1 | |
| 水 | 1 | |
| 鬼 | 1 | |
| 見 | 1 | |
| 難 | 1 | |
| 海 | 1 |
Math Operators
| Value | Count | Frequency (%) |
| − | 1 |
Katakana
| Value | Count | Frequency (%) |
| ・ | 1 |
Currency Symbols
| Value | Count | Frequency (%) |
| ₹ | 1 | |
| € | 1 |
Specials
| Value | Count | Frequency (%) |
| � | 1 |
popularity
Real number (ℝ)
SKEWED 
| Distinct | 43734 |
|---|---|
| Distinct (%) | 96.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.926356276 |
| Minimum | 0 |
|---|---|
| Maximum | 547.488298 |
| Zeros | 40 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.0208069 |
| Q1 | 0.388835 |
| median | 1.130503 |
| Q3 | 3.6906865 |
| 95-th percentile | 11.063588 |
| Maximum | 547.488298 |
| Range | 547.488298 |
| Interquartile range (IQR) | 3.3018515 |
Descriptive statistics
| Standard deviation | 6.009491011 |
|---|---|
| Coefficient of variation (CV) | 2.053574631 |
| Kurtosis | 1923.794721 |
| Mean | 2.926356276 |
| Median Absolute Deviation (MAD) | 0.967653 |
| Skewness | 29.21581948 |
| Sum | 132795.1215 |
| Variance | 36.11398221 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 × 10-6 | 56 | 0.1% |
| 0.000308 | 42 | 0.1% |
| 0 | 40 | 0.1% |
| 0.00022 | 39 | 0.1% |
| 0.000578 | 38 | 0.1% |
| 0.001177 | 38 | 0.1% |
| 0.000844 | 38 | 0.1% |
| 0.002001 | 27 | 0.1% |
| 0.003013 | 21 | < 0.1% |
| 0.00353 | 19 | < 0.1% |
| Other values (43724) | 45021 |
| Value | Count | Frequency (%) |
| 0 | 40 | |
| 1 × 10-6 | 56 | |
| 2 × 10-6 | 6 | < 0.1% |
| 3 × 10-6 | 6 | < 0.1% |
| 4 × 10-6 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 547.488298 | 1 | |
| 294.337037 | 1 | |
| 287.253654 | 1 | |
| 228.032744 | 1 | |
| 213.849907 | 1 |
| Distinct | 22706 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.6 KiB |
Length
| Max length | 1252 |
|---|---|
| Median length | 954 |
| Mean length | 70.22193085 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3186601 |
|---|---|
| Distinct characters | 293 |
| Distinct categories | 15 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 20344 ? |
|---|---|
| Unique (%) | 44.8% |
Sample
| 1st row | [{'name': 'Pixar Animation Studios', 'id': 3}] |
|---|---|
| 2nd row | [{'name': 'TriStar Pictures', 'id': 559}, {'name': 'Teitler Film', 'id': 2550}, {'name': 'Interscope Communications', 'id': 10201}] |
| 3rd row | [{'name': 'Warner Bros.', 'id': 6194}, {'name': 'Lancaster Gate', 'id': 19464}] |
| 4th row | [{'name': 'Twentieth Century Fox Film Corporation', 'id': 306}] |
| 5th row | [{'name': 'Sandollar Productions', 'id': 5842}, {'name': 'Touchstone Pictures', 'id': 9195}] |
| Value | Count | Frequency (%) |
| name | 70543 | 17.6% |
| id | 70543 | 17.6% |
| 12640 | 3.2% | |
| films | 9455 | 2.4% |
| pictures | 9267 | 2.3% |
| productions | 9062 | 2.3% |
| film | 6680 | 1.7% |
| entertainment | 5155 | 1.3% |
| corporation | 2189 | 0.5% |
| company | 1769 | 0.4% |
| Other values (42189) | 203823 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 422849 | 13.3% |
| 355760 | 11.2% | |
| i | 177494 | 5.6% |
| e | 165206 | 5.2% |
| n | 160523 | 5.0% |
| a | 147699 | 4.6% |
| : | 141093 | 4.4% |
| m | 114823 | 3.6% |
| , | 107905 | 3.4% |
| d | 104017 | 3.3% |
| Other values (283) | 1289232 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1410442 | |
| Other Punctuation | 679997 | |
| Space Separator | 355760 | 11.2% |
| Decimal Number | 295733 | 9.3% |
| Uppercase Letter | 198999 | 6.2% |
| Open Punctuation | 120249 | 3.8% |
| Close Punctuation | 120248 | 3.8% |
| Dash Punctuation | 4331 | 0.1% |
| Math Symbol | 662 | < 0.1% |
| Other Letter | 140 | < 0.1% |
| Other values (5) | 40 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 177494 | |
| e | 165206 | |
| n | 160523 | |
| a | 147699 | |
| m | 114823 | |
| d | 104017 | |
| o | 85308 | 6.0% |
| r | 83559 | 5.9% |
| t | 83450 | 5.9% |
| s | 62678 | 4.4% |
| Other values (102) | 225685 |
Other Letter
| Value | Count | Frequency (%) |
| 스 | 9 | 6.4% |
| 트 | 8 | 5.7% |
| 인 | 6 | 4.3% |
| 주 | 5 | 3.6% |
| 먼 | 5 | 3.6% |
| 테 | 5 | 3.6% |
| 터 | 5 | 3.6% |
| 엔 | 5 | 3.6% |
| 픽 | 4 | 2.9% |
| 로 | 3 | 2.1% |
| Other values (62) | 85 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 27885 | |
| F | 26364 | |
| C | 20588 | 10.3% |
| M | 13363 | 6.7% |
| S | 11915 | 6.0% |
| E | 9747 | 4.9% |
| A | 9549 | 4.8% |
| T | 9360 | 4.7% |
| B | 9002 | 4.5% |
| G | 7813 | 3.9% |
| Other values (52) | 53413 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 422849 | |
| : | 141093 | 20.7% |
| , | 107905 | 15.9% |
| . | 5671 | 0.8% |
| " | 987 | 0.1% |
| & | 764 | 0.1% |
| / | 645 | 0.1% |
| ! | 36 | < 0.1% |
| % | 18 | < 0.1% |
| \ | 12 | < 0.1% |
| Other values (6) | 17 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 45080 | |
| 2 | 33554 | |
| 3 | 31848 | |
| 4 | 30679 | |
| 6 | 28097 | |
| 5 | 27812 | |
| 8 | 25847 | |
| 7 | 24559 | |
| 9 | 24359 | |
| 0 | 23898 |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 70542 | |
| [ | 45388 | |
| ( | 4318 | 3.6% |
| ( | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 70542 | |
| ] | 45388 | |
| ) | 4317 | 3.6% |
| ) | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4329 | |
| – | 2 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 661 | |
| | | 1 | 0.2% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 23 | |
| ㈜ | 2 | 8.0% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 3 | |
| ’ | 3 |
Other Number
| Value | Count | Frequency (%) |
| ² | 1 | |
| ½ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 355760 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1609038 | |
| Common | 1577018 | |
| Cyrillic | 373 | < 0.1% |
| Hangul | 115 | < 0.1% |
| Greek | 31 | < 0.1% |
| Han | 26 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 177494 | |
| e | 165206 | 10.3% |
| n | 160523 | 10.0% |
| a | 147699 | 9.2% |
| m | 114823 | 7.1% |
| d | 104017 | 6.5% |
| o | 85308 | 5.3% |
| r | 83559 | 5.2% |
| t | 83450 | 5.2% |
| s | 62678 | 3.9% |
| Other values (99) | 424281 |
Hangul
| Value | Count | Frequency (%) |
| 스 | 9 | 7.8% |
| 트 | 8 | 7.0% |
| 인 | 6 | 5.2% |
| 주 | 5 | 4.3% |
| 먼 | 5 | 4.3% |
| 테 | 5 | 4.3% |
| 터 | 5 | 4.3% |
| 엔 | 5 | 4.3% |
| 픽 | 4 | 3.5% |
| 로 | 3 | 2.6% |
| Other values (43) | 60 |
Common
| Value | Count | Frequency (%) |
| ' | 422849 | |
| 355760 | ||
| : | 141093 | 8.9% |
| , | 107905 | 6.8% |
| { | 70542 | 4.5% |
| } | 70542 | 4.5% |
| [ | 45388 | 2.9% |
| ] | 45388 | 2.9% |
| 1 | 45080 | 2.9% |
| 2 | 33554 | 2.1% |
| Other values (36) | 238917 |
Cyrillic
| Value | Count | Frequency (%) |
| и | 34 | 9.1% |
| о | 28 | 7.5% |
| а | 26 | 7.0% |
| л | 22 | 5.9% |
| н | 20 | 5.4% |
| м | 19 | 5.1% |
| т | 17 | 4.6% |
| с | 16 | 4.3% |
| е | 16 | 4.3% |
| ь | 16 | 4.3% |
| Other values (36) | 159 |
Greek
| Value | Count | Frequency (%) |
| ο | 3 | 9.7% |
| ν | 3 | 9.7% |
| τ | 2 | 6.5% |
| ρ | 2 | 6.5% |
| ι | 2 | 6.5% |
| η | 2 | 6.5% |
| λ | 2 | 6.5% |
| Ε | 2 | 6.5% |
| Κ | 2 | 6.5% |
| α | 1 | 3.2% |
| Other values (10) | 10 |
Han
| Value | Count | Frequency (%) |
| 公 | 2 | 7.7% |
| 限 | 2 | 7.7% |
| 有 | 2 | 7.7% |
| 司 | 2 | 7.7% |
| 影 | 2 | 7.7% |
| 北 | 2 | 7.7% |
| 京 | 2 | 7.7% |
| 发 | 1 | 3.8% |
| 媒 | 1 | 3.8% |
| 传 | 1 | 3.8% |
| Other values (9) | 9 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3180377 | |
| None | 5706 | 0.2% |
| Cyrillic | 373 | < 0.1% |
| Hangul | 113 | < 0.1% |
| CJK | 26 | < 0.1% |
| Punctuation | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 422849 | 13.3% |
| 355760 | 11.2% | |
| i | 177494 | 5.6% |
| e | 165206 | 5.2% |
| n | 160523 | 5.0% |
| a | 147699 | 4.6% |
| : | 141093 | 4.4% |
| m | 114823 | 3.6% |
| , | 107905 | 3.4% |
| d | 104017 | 3.3% |
| Other values (78) | 1283008 |
None
| Value | Count | Frequency (%) |
| é | 3176 | |
| ó | 416 | 7.3% |
| á | 317 | 5.6% |
| í | 173 | 3.0% |
| ü | 154 | 2.7% |
| ñ | 150 | 2.6% |
| ô | 140 | 2.5% |
| ä | 137 | 2.4% |
| è | 136 | 2.4% |
| ö | 132 | 2.3% |
| Other values (75) | 775 | 13.6% |
Cyrillic
| Value | Count | Frequency (%) |
| и | 34 | 9.1% |
| о | 28 | 7.5% |
| а | 26 | 7.0% |
| л | 22 | 5.9% |
| н | 20 | 5.4% |
| м | 19 | 5.1% |
| т | 17 | 4.6% |
| с | 16 | 4.3% |
| е | 16 | 4.3% |
| ь | 16 | 4.3% |
| Other values (36) | 159 |
Hangul
| Value | Count | Frequency (%) |
| 스 | 9 | 8.0% |
| 트 | 8 | 7.1% |
| 인 | 6 | 5.3% |
| 주 | 5 | 4.4% |
| 먼 | 5 | 4.4% |
| 테 | 5 | 4.4% |
| 터 | 5 | 4.4% |
| 엔 | 5 | 4.4% |
| 픽 | 4 | 3.5% |
| 로 | 3 | 2.7% |
| Other values (42) | 58 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 | |
| – | 2 | |
| • | 1 | 16.7% |
CJK
| Value | Count | Frequency (%) |
| 公 | 2 | 7.7% |
| 限 | 2 | 7.7% |
| 有 | 2 | 7.7% |
| 司 | 2 | 7.7% |
| 影 | 2 | 7.7% |
| 北 | 2 | 7.7% |
| 京 | 2 | 7.7% |
| 发 | 1 | 3.8% |
| 媒 | 1 | 3.8% |
| 传 | 1 | 3.8% |
| Other values (9) | 9 |
| Distinct | 2390 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.6 KiB |
Length
| Max length | 1039 |
|---|---|
| Median length | 649 |
| Mean length | 53.28682871 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2418103 |
|---|---|
| Distinct characters | 65 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1765 ? |
|---|---|
| Unique (%) | 3.9% |
Sample
| 1st row | [{'iso_3166_1': 'US', 'name': 'United States of America'}] |
|---|---|
| 2nd row | [{'iso_3166_1': 'US', 'name': 'United States of America'}] |
| 3rd row | [{'iso_3166_1': 'US', 'name': 'United States of America'}] |
| 4th row | [{'iso_3166_1': 'US', 'name': 'United States of America'}] |
| 5th row | [{'iso_3166_1': 'US', 'name': 'United States of America'}] |
| Value | Count | Frequency (%) |
| iso_3166_1 | 49415 | |
| name | 49415 | |
| united | 25269 | |
| states | 21150 | |
| of | 21149 | |
| america | 21149 | |
| us | 21149 | |
| 6211 | 2.3% | |
| gb | 4092 | 1.5% |
| kingdom | 4092 | 1.5% |
| Other values (341) | 50133 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 395315 | |
| 227845 | 9.4% | |
| e | 130072 | 5.4% |
| a | 119915 | 5.0% |
| i | 107969 | 4.5% |
| _ | 98830 | 4.1% |
| 1 | 98830 | 4.1% |
| 6 | 98830 | 4.1% |
| : | 98830 | 4.1% |
| n | 96917 | 4.0% |
| Other values (55) | 944750 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 904537 | |
| Other Punctuation | 553817 | |
| Decimal Number | 247075 | 10.2% |
| Space Separator | 227845 | 9.4% |
| Uppercase Letter | 196411 | 8.1% |
| Connector Punctuation | 98830 | 4.1% |
| Close Punctuation | 94794 | 3.9% |
| Open Punctuation | 94794 | 3.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 130072 | |
| a | 119915 | |
| i | 107969 | |
| n | 96917 | |
| o | 78999 | |
| m | 78123 | |
| s | 74105 | |
| t | 72626 | |
| d | 34551 | 3.8% |
| r | 32493 | 3.6% |
| Other values (16) | 78767 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 48397 | |
| S | 46881 | |
| A | 25529 | |
| F | 8676 | 4.4% |
| R | 7993 | 4.1% |
| I | 7595 | 3.9% |
| G | 6922 | 3.5% |
| K | 6808 | 3.5% |
| B | 5858 | 3.0% |
| C | 5371 | 2.7% |
| Other values (16) | 26381 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 395315 | |
| : | 98830 | 17.8% |
| , | 59662 | 10.8% |
| " | 10 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 98830 | |
| 6 | 98830 | |
| 3 | 49415 |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 49415 | |
| ] | 45379 |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 49415 | |
| [ | 45379 |
Space Separator
| Value | Count | Frequency (%) |
| 227845 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 98830 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1317155 | |
| Latin | 1100948 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 130072 | |
| a | 119915 | |
| i | 107969 | |
| n | 96917 | 8.8% |
| o | 78999 | 7.2% |
| m | 78123 | 7.1% |
| s | 74105 | 6.7% |
| t | 72626 | 6.6% |
| U | 48397 | 4.4% |
| S | 46881 | 4.3% |
| Other values (42) | 246944 |
Common
| Value | Count | Frequency (%) |
| ' | 395315 | |
| 227845 | ||
| _ | 98830 | 7.5% |
| 1 | 98830 | 7.5% |
| 6 | 98830 | 7.5% |
| : | 98830 | 7.5% |
| , | 59662 | 4.5% |
| } | 49415 | 3.8% |
| { | 49415 | 3.8% |
| 3 | 49415 | 3.8% |
| Other values (3) | 90768 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2418103 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 395315 | |
| 227845 | 9.4% | |
| e | 130072 | 5.4% |
| a | 119915 | 5.0% |
| i | 107969 | 4.5% |
| _ | 98830 | 4.1% |
| 1 | 98830 | 4.1% |
| 6 | 98830 | 4.1% |
| : | 98830 | 4.1% |
| n | 96917 | 4.0% |
| Other values (55) | 944750 |
release_date
Text
| Distinct | 17334 |
|---|---|
| Distinct (%) | 38.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.6 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 453790 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8571 ? |
|---|---|
| Unique (%) | 18.9% |
Sample
| 1st row | 1995-10-30 |
|---|---|
| 2nd row | 1995-12-15 |
| 3rd row | 1995-12-22 |
| 4th row | 1995-12-22 |
| 5th row | 1995-02-10 |
| Value | Count | Frequency (%) |
| 2008-01-01 | 136 | 0.3% |
| 2009-01-01 | 121 | 0.3% |
| 2007-01-01 | 118 | 0.3% |
| 2005-01-01 | 111 | 0.2% |
| 2006-01-01 | 101 | 0.2% |
| 2002-01-01 | 96 | 0.2% |
| 2004-01-01 | 90 | 0.2% |
| 2001-01-01 | 84 | 0.2% |
| 2003-01-01 | 76 | 0.2% |
| 1997-01-01 | 69 | 0.2% |
| Other values (17324) | 44377 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 97607 | |
| - | 90758 | |
| 1 | 84059 | |
| 2 | 52808 | |
| 9 | 39777 | |
| 3 | 15435 | 3.4% |
| 8 | 15280 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14290 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 363032 | |
| Dash Punctuation | 90758 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 97607 | |
| 1 | 84059 | |
| 2 | 52808 | |
| 9 | 39777 | |
| 3 | 15435 | 4.3% |
| 8 | 15280 | 4.2% |
| 6 | 15021 | 4.1% |
| 5 | 14836 | 4.1% |
| 7 | 14290 | 3.9% |
| 4 | 13919 | 3.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 90758 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 453790 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 97607 | |
| - | 90758 | |
| 1 | 84059 | |
| 2 | 52808 | |
| 9 | 39777 | |
| 3 | 15435 | 3.4% |
| 8 | 15280 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14290 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 453790 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 97607 | |
| - | 90758 | |
| 1 | 84059 | |
| 2 | 52808 | |
| 9 | 39777 | |
| 3 | 15435 | 3.4% |
| 8 | 15280 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14290 | 3.1% |
revenue
Real number (ℝ)
ZEROS 
| Distinct | 6863 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11229356.85 |
| Minimum | 0 |
|---|---|
| Maximum | 2787965087 |
| Zeros | 37972 |
| Zeros (%) | 83.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 48018458.8 |
| Maximum | 2787965087 |
| Range | 2787965087 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 64387893.11 |
|---|---|
| Coefficient of variation (CV) | 5.733889657 |
| Kurtosis | 237.0928809 |
| Mean | 11229356.85 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.25512449 |
| Sum | 5.095769846 × 1011 |
| Variance | 4.145800779 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37972 | |
| 12000000 | 20 | < 0.1% |
| 10000000 | 19 | < 0.1% |
| 11000000 | 19 | < 0.1% |
| 2000000 | 18 | < 0.1% |
| 6000000 | 17 | < 0.1% |
| 5000000 | 14 | < 0.1% |
| 8000000 | 13 | < 0.1% |
| 500000 | 13 | < 0.1% |
| 1 | 12 | < 0.1% |
| Other values (6853) | 7262 | 16.0% |
| Value | Count | Frequency (%) |
| 0 | 37972 | |
| 1 | 12 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 2787965087 | 1 | |
| 2068223624 | 1 | |
| 1845034188 | 1 | |
| 1519557910 | 1 | |
| 1513528810 | 1 |
runtime
Real number (ℝ)
ZEROS 
| Distinct | 353 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 246 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.1810427 |
| Minimum | 0 |
|---|---|
| Maximum | 1256 |
| Zeros | 1535 |
| Zeros (%) | 3.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 85 |
| median | 95 |
| Q3 | 107 |
| 95-th percentile | 138 |
| Maximum | 1256 |
| Range | 1256 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 38.34005304 |
|---|---|
| Coefficient of variation (CV) | 0.4070888572 |
| Kurtosis | 93.92956811 |
| Mean | 94.1810427 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 4.490833231 |
| Sum | 4250673 |
| Variance | 1469.959667 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 2549 | 5.6% |
| 0 | 1535 | 3.4% |
| 100 | 1470 | 3.2% |
| 95 | 1410 | 3.1% |
| 93 | 1214 | 2.7% |
| 96 | 1104 | 2.4% |
| 92 | 1079 | 2.4% |
| 94 | 1062 | 2.3% |
| 91 | 1055 | 2.3% |
| 88 | 1030 | 2.3% |
| Other values (343) | 31625 |
| Value | Count | Frequency (%) |
| 0 | 1535 | |
| 1 | 107 | 0.2% |
| 2 | 33 | 0.1% |
| 3 | 48 | 0.1% |
| 4 | 50 | 0.1% |
| Value | Count | Frequency (%) |
| 1256 | 1 | |
| 1140 | 2 | |
| 931 | 1 | |
| 925 | 1 | |
| 900 | 1 |
spoken_languages
Text
| Distinct | 1931 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.6 KiB |
Length
| Max length | 765 |
|---|---|
| Median length | 40 |
| Mean length | 46.98891558 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2132310 |
|---|---|
| Distinct characters | 184 |
| Distinct categories | 11 ? |
| Distinct scripts | 15 ? |
| Distinct blocks | 16 ? |
Unique
| Unique | 1366 ? |
|---|---|
| Unique (%) | 3.0% |
Sample
| 1st row | [{'iso_639_1': 'en', 'name': 'English'}] |
|---|---|
| 2nd row | [{'iso_639_1': 'en', 'name': 'English'}, {'iso_639_1': 'fr', 'name': 'Français'}] |
| 3rd row | [{'iso_639_1': 'en', 'name': 'English'}] |
| 4th row | [{'iso_639_1': 'en', 'name': 'English'}] |
| 5th row | [{'iso_639_1': 'en', 'name': 'English'}] |
| Value | Count | Frequency (%) |
| iso_639_1 | 53277 | |
| name | 53277 | |
| english | 28731 | |
| en | 28731 | |
| 4748 | 2.2% | |
| fr | 4194 | 1.9% |
| français | 4194 | 1.9% |
| deutsch | 2624 | 1.2% |
| de | 2624 | 1.2% |
| español | 2412 | 1.1% |
| Other values (203) | 33477 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 426216 | |
| 172910 | 8.1% | |
| n | 120549 | 5.7% |
| _ | 106554 | 5.0% |
| : | 106554 | 5.0% |
| s | 99179 | 4.7% |
| i | 94078 | 4.4% |
| e | 92705 | 4.3% |
| a | 75201 | 3.5% |
| , | 64943 | 3.0% |
| Other values (174) | 773421 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 771585 | |
| Other Punctuation | 598804 | |
| Decimal Number | 213134 | 10.0% |
| Space Separator | 172910 | 8.1% |
| Connector Punctuation | 106554 | 5.0% |
| Close Punctuation | 98656 | 4.6% |
| Open Punctuation | 98656 | 4.6% |
| Uppercase Letter | 46430 | 2.2% |
| Other Letter | 22194 | 1.0% |
| Spacing Mark | 1838 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 120549 | |
| s | 99179 | |
| i | 94078 | |
| e | 92705 | |
| a | 75201 | |
| o | 61230 | |
| m | 53989 | |
| l | 36035 | 4.7% |
| h | 33815 | 4.4% |
| g | 30514 | 4.0% |
| Other values (65) | 74290 |
Other Letter
| Value | Count | Frequency (%) |
| 語 | 1759 | 7.9% |
| 本 | 1759 | 7.9% |
| 日 | 1759 | 7.9% |
| 话 | 1263 | 5.7% |
| 州 | 946 | 4.3% |
| 普 | 790 | 3.6% |
| 通 | 790 | 3.6% |
| द | 707 | 3.2% |
| ह | 707 | 3.2% |
| न | 707 | 3.2% |
| Other values (46) | 11007 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 31200 | |
| F | 4196 | 9.0% |
| D | 2926 | 6.3% |
| P | 2677 | 5.8% |
| I | 2366 | 5.1% |
| N | 829 | 1.8% |
| L | 505 | 1.1% |
| M | 362 | 0.8% |
| T | 308 | 0.7% |
| Č | 284 | 0.6% |
| Other values (13) | 777 | 1.7% |
Spacing Mark
| Value | Count | Frequency (%) |
| ी | 707 | |
| ि | 707 | |
| ు | 136 | 7.4% |
| ி | 111 | 6.0% |
| া | 94 | 5.1% |
| ং | 47 | 2.6% |
| ਾ | 18 | 1.0% |
| ੀ | 18 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 426216 | |
| : | 106554 | 17.8% |
| , | 64943 | 10.8% |
| / | 1015 | 0.2% |
| ? | 50 | < 0.1% |
| \ | 26 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ् | 707 | |
| ִ | 430 | |
| ְ | 215 | 13.9% |
| ் | 111 | 7.2% |
| ె | 68 | 4.4% |
| ੰ | 18 | 1.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 53303 | |
| 3 | 53277 | |
| 6 | 53277 | |
| 1 | 53277 |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 53277 | |
| ] | 45379 |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 53277 | |
| [ | 45379 |
Space Separator
| Value | Count | Frequency (%) |
| 172910 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 106554 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1288714 | |
| Latin | 805626 | |
| Han | 10485 | 0.5% |
| Cyrillic | 10454 | 0.5% |
| Devanagari | 4242 | 0.2% |
| Arabic | 3344 | 0.2% |
| Hangul | 3252 | 0.2% |
| Hebrew | 1720 | 0.1% |
| Greek | 1704 | 0.1% |
| Thai | 1232 | 0.1% |
| Other values (5) | 1537 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 120549 | |
| s | 99179 | |
| i | 94078 | |
| e | 92705 | |
| a | 75201 | |
| o | 61230 | |
| m | 53989 | |
| l | 36035 | 4.5% |
| h | 33815 | 4.2% |
| E | 31200 | 3.9% |
| Other values (52) | 107645 |
Cyrillic
| Value | Count | Frequency (%) |
| с | 3211 | |
| к | 1734 | |
| и | 1679 | |
| й | 1615 | |
| у | 1564 | |
| а | 113 | 1.1% |
| р | 87 | 0.8% |
| У | 53 | 0.5% |
| ї | 53 | 0.5% |
| н | 53 | 0.5% |
| Other values (12) | 292 | 2.8% |
Common
| Value | Count | Frequency (%) |
| ' | 426216 | |
| 172910 | ||
| _ | 106554 | 8.3% |
| : | 106554 | 8.3% |
| , | 64943 | 5.0% |
| 9 | 53303 | 4.1% |
| } | 53277 | 4.1% |
| { | 53277 | 4.1% |
| 3 | 53277 | 4.1% |
| 6 | 53277 | 4.1% |
| Other values (6) | 145126 | 11.3% |
Arabic
| Value | Count | Frequency (%) |
| ا | 537 | |
| ر | 537 | |
| ب | 341 | |
| ل | 341 | |
| ة | 341 | |
| ع | 341 | |
| ي | 341 | |
| س | 141 | 4.2% |
| ی | 141 | 4.2% |
| ف | 141 | 4.2% |
| Other values (5) | 142 | 4.2% |
Han
| Value | Count | Frequency (%) |
| 語 | 1759 | |
| 本 | 1759 | |
| 日 | 1759 | |
| 话 | 1263 | |
| 州 | 946 | |
| 普 | 790 | |
| 通 | 790 | |
| 話 | 473 | 4.5% |
| 廣 | 473 | 4.5% |
| 广 | 473 | 4.5% |
Hebrew
| Value | Count | Frequency (%) |
| ִ | 430 | |
| ע | 215 | |
| ת | 215 | |
| י | 215 | |
| ר | 215 | |
| ְ | 215 | |
| ב | 215 |
Greek
| Value | Count | Frequency (%) |
| λ | 426 | |
| ε | 213 | |
| η | 213 | |
| ν | 213 | |
| ά | 213 | |
| κ | 213 | |
| ι | 213 |
Georgian
| Value | Count | Frequency (%) |
| უ | 33 | |
| თ | 33 | |
| ი | 33 | |
| ლ | 33 | |
| ქ | 33 | |
| ა | 33 | |
| რ | 33 |
Devanagari
| Value | Count | Frequency (%) |
| ी | 707 | |
| द | 707 | |
| ् | 707 | |
| ह | 707 | |
| न | 707 | |
| ि | 707 |
Hangul
| Value | Count | Frequency (%) |
| 선 | 542 | |
| 말 | 542 | |
| 한 | 542 | |
| 조 | 542 | |
| 어 | 542 | |
| 국 | 542 |
Thai
| Value | Count | Frequency (%) |
| า | 352 | |
| ษ | 176 | |
| ไ | 176 | |
| ท | 176 | |
| ย | 176 | |
| ภ | 176 |
Gurmukhi
| Value | Count | Frequency (%) |
| ਪ | 18 | |
| ੰ | 18 | |
| ਜ | 18 | |
| ਾ | 18 | |
| ਬ | 18 | |
| ੀ | 18 |
Telugu
| Value | Count | Frequency (%) |
| ు | 136 | |
| గ | 68 | |
| ల | 68 | |
| త | 68 | |
| ె | 68 |
Tamil
| Value | Count | Frequency (%) |
| ம | 111 | |
| ் | 111 | |
| ழ | 111 | |
| த | 111 | |
| ி | 111 |
Bengali
| Value | Count | Frequency (%) |
| া | 94 | |
| ং | 47 | |
| ল | 47 | |
| ব | 47 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2085510 | |
| CJK | 10485 | 0.5% |
| Cyrillic | 10454 | 0.5% |
| None | 10408 | 0.5% |
| Devanagari | 4242 | 0.2% |
| Arabic | 3344 | 0.2% |
| Hangul | 3252 | 0.2% |
| Hebrew | 1720 | 0.1% |
| Thai | 1232 | 0.1% |
| Tamil | 555 | < 0.1% |
| Other values (6) | 1108 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 426216 | |
| 172910 | 8.3% | |
| n | 120549 | 5.8% |
| _ | 106554 | 5.1% |
| : | 106554 | 5.1% |
| s | 99179 | 4.8% |
| i | 94078 | 4.5% |
| e | 92705 | 4.4% |
| a | 75201 | 3.6% |
| , | 64943 | 3.1% |
| Other values (52) | 726621 |
None
| Value | Count | Frequency (%) |
| ç | 4441 | |
| ñ | 2412 | |
| ê | 591 | 5.7% |
| λ | 426 | 4.1% |
| ý | 284 | 2.7% |
| Č | 284 | 2.7% |
| ü | 247 | 2.4% |
| ε | 213 | 2.0% |
| η | 213 | 2.0% |
| ν | 213 | 2.0% |
| Other values (10) | 1084 | 10.4% |
Cyrillic
| Value | Count | Frequency (%) |
| с | 3211 | |
| к | 1734 | |
| и | 1679 | |
| й | 1615 | |
| у | 1564 | |
| а | 113 | 1.1% |
| р | 87 | 0.8% |
| У | 53 | 0.5% |
| ї | 53 | 0.5% |
| н | 53 | 0.5% |
| Other values (12) | 292 | 2.8% |
CJK
| Value | Count | Frequency (%) |
| 語 | 1759 | |
| 本 | 1759 | |
| 日 | 1759 | |
| 话 | 1263 | |
| 州 | 946 | |
| 普 | 790 | |
| 通 | 790 | |
| 話 | 473 | 4.5% |
| 廣 | 473 | 4.5% |
| 广 | 473 | 4.5% |
Devanagari
| Value | Count | Frequency (%) |
| ी | 707 | |
| द | 707 | |
| ् | 707 | |
| ह | 707 | |
| न | 707 | |
| ि | 707 |
Hangul
| Value | Count | Frequency (%) |
| 선 | 542 | |
| 말 | 542 | |
| 한 | 542 | |
| 조 | 542 | |
| 어 | 542 | |
| 국 | 542 |
Arabic
| Value | Count | Frequency (%) |
| ا | 537 | |
| ر | 537 | |
| ب | 341 | |
| ل | 341 | |
| ة | 341 | |
| ع | 341 | |
| ي | 341 | |
| س | 141 | 4.2% |
| ی | 141 | 4.2% |
| ف | 141 | 4.2% |
| Other values (5) | 142 | 4.2% |
Hebrew
| Value | Count | Frequency (%) |
| ִ | 430 | |
| ע | 215 | |
| ת | 215 | |
| י | 215 | |
| ר | 215 | |
| ְ | 215 | |
| ב | 215 |
Thai
| Value | Count | Frequency (%) |
| า | 352 | |
| ษ | 176 | |
| ไ | 176 | |
| ท | 176 | |
| ย | 176 | |
| ภ | 176 |
Telugu
| Value | Count | Frequency (%) |
| ు | 136 | |
| గ | 68 | |
| ల | 68 | |
| త | 68 | |
| ె | 68 |
Tamil
| Value | Count | Frequency (%) |
| ம | 111 | |
| ் | 111 | |
| ழ | 111 | |
| த | 111 | |
| ி | 111 |
Bengali
| Value | Count | Frequency (%) |
| া | 94 | |
| ং | 47 | |
| ল | 47 | |
| ব | 47 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ế | 61 | |
| ệ | 61 |
Georgian
| Value | Count | Frequency (%) |
| უ | 33 | |
| თ | 33 | |
| ი | 33 | |
| ლ | 33 | |
| ქ | 33 | |
| ა | 33 | |
| რ | 33 |
Gurmukhi
| Value | Count | Frequency (%) |
| ਪ | 18 | |
| ੰ | 18 | |
| ਜ | 18 | |
| ਾ | 18 | |
| ਬ | 18 | |
| ੀ | 18 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 4 |
status
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 80 |
| Missing (%) | 0.2% |
| Memory size | 354.6 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 8.011722113 |
| Min length | 7 |
Characters and Unicode
| Total characters | 362923 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Released |
|---|---|
| 2nd row | Released |
| 3rd row | Released |
| 4th row | Released |
| 5th row | Released |
| Value | Count | Frequency (%) |
| released | 44939 | |
| rumored | 230 | 0.5% |
| production | 116 | 0.3% |
| post | 97 | 0.2% |
| in | 19 | < 0.1% |
| planned | 13 | < 0.1% |
| canceled | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 135062 | |
| d | 45299 | 12.5% |
| R | 45169 | 12.4% |
| s | 45036 | 12.4% |
| l | 44953 | 12.4% |
| a | 44953 | 12.4% |
| o | 559 | 0.2% |
| r | 346 | 0.1% |
| u | 346 | 0.1% |
| m | 230 | 0.1% |
| Other values (8) | 970 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 317392 | |
| Uppercase Letter | 45415 | 12.5% |
| Space Separator | 116 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 135062 | |
| d | 45299 | 14.3% |
| s | 45036 | 14.2% |
| l | 44953 | 14.2% |
| a | 44953 | 14.2% |
| o | 559 | 0.2% |
| r | 346 | 0.1% |
| u | 346 | 0.1% |
| m | 230 | 0.1% |
| t | 213 | 0.1% |
| Other values (3) | 395 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 45169 | |
| P | 226 | 0.5% |
| I | 19 | < 0.1% |
| C | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 362807 | |
| Common | 116 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 135062 | |
| d | 45299 | 12.5% |
| R | 45169 | 12.4% |
| s | 45036 | 12.4% |
| l | 44953 | 12.4% |
| a | 44953 | 12.4% |
| o | 559 | 0.2% |
| r | 346 | 0.1% |
| u | 346 | 0.1% |
| m | 230 | 0.1% |
| Other values (7) | 854 | 0.2% |
Common
| Value | Count | Frequency (%) |
| 116 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 362923 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 135062 | |
| d | 45299 | 12.5% |
| R | 45169 | 12.4% |
| s | 45036 | 12.4% |
| l | 44953 | 12.4% |
| a | 44953 | 12.4% |
| o | 559 | 0.2% |
| r | 346 | 0.1% |
| u | 346 | 0.1% |
| m | 230 | 0.1% |
| Other values (8) | 970 | 0.3% |
tagline
Text
MISSING 
| Distinct | 20270 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 24980 |
| Missing (%) | 55.0% |
| Memory size | 354.6 KiB |
Length
| Max length | 297 |
|---|---|
| Median length | 204 |
| Mean length | 46.99803912 |
| Min length | 1 |
Characters and Unicode
| Total characters | 958713 |
|---|---|
| Distinct characters | 170 |
| Distinct categories | 17 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 10 ? |
Unique
| Unique | 20164 ? |
|---|---|
| Unique (%) | 98.8% |
Sample
| 1st row | Roll the dice and unleash the excitement! |
|---|---|
| 2nd row | Still Yelling. Still Fighting. Still Ready for Love. |
| 3rd row | Friends are the people who let you be yourself... and never let you forget it. |
| 4th row | Just When His World Is Back To Normal... He's In For The Surprise Of His Life! |
| 5th row | A Los Angeles Crime Saga |
| Value | Count | Frequency (%) |
| the | 10998 | 6.3% |
| a | 6815 | 3.9% |
| of | 4405 | 2.5% |
| to | 3584 | 2.1% |
| is | 2796 | 1.6% |
| in | 2693 | 1.5% |
| and | 2682 | 1.5% |
| you | 2389 | 1.4% |
| 1582 | 0.9% | |
| for | 1523 | 0.9% |
| Other values (15100) | 134473 |
Most occurring characters
| Value | Count | Frequency (%) |
| 153689 | ||
| e | 94415 | 9.8% |
| t | 57269 | 6.0% |
| o | 56567 | 5.9% |
| a | 51474 | 5.4% |
| n | 47498 | 5.0% |
| i | 46037 | 4.8% |
| r | 44994 | 4.7% |
| s | 42362 | 4.4% |
| h | 37172 | 3.9% |
| Other values (160) | 327236 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 680493 | |
| Space Separator | 153689 | 16.0% |
| Uppercase Letter | 74995 | 7.8% |
| Other Punctuation | 44585 | 4.7% |
| Decimal Number | 2687 | 0.3% |
| Dash Punctuation | 1944 | 0.2% |
| Final Punctuation | 98 | < 0.1% |
| Open Punctuation | 56 | < 0.1% |
| Close Punctuation | 55 | < 0.1% |
| Currency Symbol | 37 | < 0.1% |
| Other values (7) | 74 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 94415 | |
| t | 57269 | 8.4% |
| o | 56567 | 8.3% |
| a | 51474 | 7.6% |
| n | 47498 | 7.0% |
| i | 46037 | 6.8% |
| r | 44994 | 6.6% |
| s | 42362 | 6.2% |
| h | 37172 | 5.5% |
| l | 30174 | 4.4% |
| Other values (43) | 172531 |
Other Letter
| Value | Count | Frequency (%) |
| வ | 1 | 2.9% |
| ன | 1 | 2.9% |
| 成 | 1 | 2.9% |
| 劇 | 1 | 2.9% |
| 熟 | 1 | 2.9% |
| த | 1 | 2.9% |
| ஆ | 1 | 2.9% |
| 時 | 1 | 2.9% |
| 舞 | 1 | 2.9% |
| 場 | 1 | 2.9% |
| Other values (24) | 24 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 10009 | 13.3% |
| A | 6874 | 9.2% |
| S | 5652 | 7.5% |
| H | 4402 | 5.9% |
| I | 4387 | 5.8% |
| E | 4306 | 5.7% |
| W | 3681 | 4.9% |
| O | 3478 | 4.6% |
| N | 3195 | 4.3% |
| L | 3194 | 4.3% |
| Other values (20) | 25817 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 26647 | |
| ! | 5784 | 13.0% |
| ' | 5674 | 12.7% |
| , | 4226 | 9.5% |
| ? | 1161 | 2.6% |
| " | 582 | 1.3% |
| … | 148 | 0.3% |
| : | 138 | 0.3% |
| & | 83 | 0.2% |
| * | 42 | 0.1% |
| Other values (7) | 100 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 802 | |
| 1 | 516 | |
| 2 | 299 | 11.1% |
| 3 | 208 | 7.7% |
| 9 | 208 | 7.7% |
| 5 | 168 | 6.3% |
| 4 | 140 | 5.2% |
| 6 | 121 | 4.5% |
| 7 | 121 | 4.5% |
| 8 | 104 | 3.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5 | |
| = | 5 | |
| | | 2 | 14.3% |
| ~ | 1 | 7.1% |
| − | 1 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1927 | |
| – | 9 | 0.5% |
| — | 8 | 0.4% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 82 | |
| ” | 15 | 15.3% |
| » | 1 | 1.0% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 14 | |
| ‘ | 4 | 21.1% |
| « | 1 | 5.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 49 | |
| [ | 7 | 12.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 48 | |
| ] | 7 | 12.7% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 | |
| ² | 1 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˌ | 1 | |
| ˈ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 153689 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 37 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ் | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 755488 | |
| Common | 203190 | 21.2% |
| Han | 21 | < 0.1% |
| Tamil | 5 | < 0.1% |
| Hiragana | 5 | < 0.1% |
| Katakana | 4 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 94415 | 12.5% |
| t | 57269 | 7.6% |
| o | 56567 | 7.5% |
| a | 51474 | 6.8% |
| n | 47498 | 6.3% |
| i | 46037 | 6.1% |
| r | 44994 | 6.0% |
| s | 42362 | 5.6% |
| h | 37172 | 4.9% |
| l | 30174 | 4.0% |
| Other values (73) | 247526 |
Common
| Value | Count | Frequency (%) |
| 153689 | ||
| . | 26647 | 13.1% |
| ! | 5784 | 2.8% |
| ' | 5674 | 2.8% |
| , | 4226 | 2.1% |
| - | 1927 | 0.9% |
| ? | 1161 | 0.6% |
| 0 | 802 | 0.4% |
| " | 582 | 0.3% |
| 1 | 516 | 0.3% |
| Other values (42) | 2182 | 1.1% |
Han
| Value | Count | Frequency (%) |
| 成 | 1 | 4.8% |
| 劇 | 1 | 4.8% |
| 熟 | 1 | 4.8% |
| 時 | 1 | 4.8% |
| 舞 | 1 | 4.8% |
| 場 | 1 | 4.8% |
| 版 | 1 | 4.8% |
| 蜜 | 1 | 4.8% |
| 最 | 1 | 4.8% |
| 后 | 1 | 4.8% |
| Other values (11) | 11 |
Tamil
| Value | Count | Frequency (%) |
| வ | 1 | |
| ் | 1 | |
| ன | 1 | |
| த | 1 | |
| ஆ | 1 |
Hiragana
| Value | Count | Frequency (%) |
| は | 1 | |
| し | 1 | |
| て | 1 | |
| い | 1 | |
| る | 1 |
Katakana
| Value | Count | Frequency (%) |
| ク | 1 | |
| ラ | 1 | |
| ナ | 1 | |
| ド | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 958283 | |
| Punctuation | 280 | < 0.1% |
| None | 110 | < 0.1% |
| CJK | 21 | < 0.1% |
| Tamil | 5 | < 0.1% |
| Hiragana | 5 | < 0.1% |
| Katakana | 4 | < 0.1% |
| IPA Ext | 2 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
| Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 153689 | ||
| e | 94415 | 9.9% |
| t | 57269 | 6.0% |
| o | 56567 | 5.9% |
| a | 51474 | 5.4% |
| n | 47498 | 5.0% |
| i | 46037 | 4.8% |
| r | 44994 | 4.7% |
| s | 42362 | 4.4% |
| h | 37172 | 3.9% |
| Other values (78) | 326806 |
Punctuation
| Value | Count | Frequency (%) |
| … | 148 | |
| ’ | 82 | |
| ” | 15 | 5.4% |
| “ | 14 | 5.0% |
| – | 9 | 3.2% |
| — | 8 | 2.9% |
| ‘ | 4 | 1.4% |
None
| Value | Count | Frequency (%) |
| é | 18 | |
| ä | 16 | |
| ö | 8 | 7.3% |
| á | 6 | 5.5% |
| ó | 6 | 5.5% |
| ü | 5 | 4.5% |
| í | 5 | 4.5% |
| ı | 5 | 4.5% |
| · | 4 | 3.6% |
| ć | 3 | 2.7% |
| Other values (26) | 34 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 2 |
Tamil
| Value | Count | Frequency (%) |
| வ | 1 | |
| ் | 1 | |
| ன | 1 | |
| த | 1 | |
| ஆ | 1 |
CJK
| Value | Count | Frequency (%) |
| 成 | 1 | 4.8% |
| 劇 | 1 | 4.8% |
| 熟 | 1 | 4.8% |
| 時 | 1 | 4.8% |
| 舞 | 1 | 4.8% |
| 場 | 1 | 4.8% |
| 版 | 1 | 4.8% |
| 蜜 | 1 | 4.8% |
| 最 | 1 | 4.8% |
| 后 | 1 | 4.8% |
| Other values (11) | 11 |
Katakana
| Value | Count | Frequency (%) |
| ク | 1 | |
| ラ | 1 | |
| ナ | 1 | |
| ド | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˌ | 1 | |
| ˈ | 1 |
Hiragana
| Value | Count | Frequency (%) |
| は | 1 | |
| し | 1 | |
| て | 1 | |
| い | 1 | |
| る | 1 |
Math Operators
| Value | Count | Frequency (%) |
| − | 1 |
title
Text
| Distinct | 42198 |
|---|---|
| Distinct (%) | 93.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.6 KiB |
Length
| Max length | 105 |
|---|---|
| Median length | 79 |
| Mean length | 16.70206483 |
| Min length | 1 |
Characters and Unicode
| Total characters | 757923 |
|---|---|
| Distinct characters | 287 |
| Distinct categories | 17 ? |
| Distinct scripts | 7 ? |
| Distinct blocks | 12 ? |
Unique
| Unique | 39870 ? |
|---|---|
| Unique (%) | 87.9% |
Sample
| 1st row | Toy Story |
|---|---|
| 2nd row | Jumanji |
| 3rd row | Grumpier Old Men |
| 4th row | Waiting to Exhale |
| 5th row | Father of the Bride Part II |
| Value | Count | Frequency (%) |
| the | 14556 | 10.7% |
| of | 4930 | 3.6% |
| a | 2241 | 1.6% |
| in | 1693 | 1.2% |
| and | 1631 | 1.2% |
| to | 1054 | 0.8% |
| 757 | 0.6% | |
| man | 666 | 0.5% |
| love | 664 | 0.5% |
| for | 601 | 0.4% |
| Other values (24354) | 107397 |
Most occurring characters
| Value | Count | Frequency (%) |
| 90833 | 12.0% | |
| e | 76254 | 10.1% |
| a | 48947 | 6.5% |
| o | 45672 | 6.0% |
| n | 40820 | 5.4% |
| r | 40022 | 5.3% |
| i | 39767 | 5.2% |
| t | 36724 | 4.8% |
| s | 29521 | 3.9% |
| h | 28522 | 3.8% |
| Other values (277) | 280841 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 534181 | |
| Uppercase Letter | 117274 | 15.5% |
| Space Separator | 90833 | 12.0% |
| Other Punctuation | 10490 | 1.4% |
| Decimal Number | 3850 | 0.5% |
| Dash Punctuation | 981 | 0.1% |
| Close Punctuation | 87 | < 0.1% |
| Open Punctuation | 85 | < 0.1% |
| Final Punctuation | 38 | < 0.1% |
| Other Letter | 25 | < 0.1% |
| Other values (7) | 79 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 76254 | |
| a | 48947 | |
| o | 45672 | 8.5% |
| n | 40820 | 7.6% |
| r | 40022 | 7.5% |
| i | 39767 | 7.4% |
| t | 36724 | 6.9% |
| s | 29521 | 5.5% |
| h | 28522 | 5.3% |
| l | 25926 | 4.9% |
| Other values (121) | 122006 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 16021 | |
| S | 10338 | 8.8% |
| M | 8034 | 6.9% |
| B | 7659 | 6.5% |
| C | 7165 | 6.1% |
| A | 6786 | 5.8% |
| D | 6335 | 5.4% |
| L | 5872 | 5.0% |
| H | 5170 | 4.4% |
| W | 5166 | 4.4% |
| Other values (65) | 38728 |
Other Letter
| Value | Count | Frequency (%) |
| چ | 2 | 8.0% |
| ه | 2 | 8.0% |
| ی | 2 | 8.0% |
| ک | 2 | 8.0% |
| 傳 | 1 | 4.0% |
| 空 | 1 | 4.0% |
| 時 | 1 | 4.0% |
| 狗 | 1 | 4.0% |
| 貓 | 1 | 4.0% |
| ª | 1 | 4.0% |
| Other values (11) | 11 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3718 | |
| ' | 2505 | |
| . | 1603 | |
| , | 1134 | 10.8% |
| ! | 647 | 6.2% |
| & | 458 | 4.4% |
| ? | 269 | 2.6% |
| / | 79 | 0.8% |
| * | 19 | 0.2% |
| # | 13 | 0.1% |
| Other values (8) | 45 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 861 | |
| 1 | 697 | |
| 0 | 616 | |
| 3 | 482 | |
| 9 | 230 | 6.0% |
| 4 | 229 | 5.9% |
| 5 | 225 | 5.8% |
| 7 | 193 | 5.0% |
| 8 | 161 | 4.2% |
| 6 | 156 | 4.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 17 | |
| × | 3 | 12.5% |
| ∞ | 1 | 4.2% |
| = | 1 | 4.2% |
| → | 1 | 4.2% |
| − | 1 | 4.2% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 12 | |
| ² | 3 | 15.8% |
| ³ | 2 | 10.5% |
| ⅓ | 1 | 5.3% |
| ⁴ | 1 | 5.3% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 | |
| ☆ | 2 | |
| ™ | 1 | 12.5% |
| ♡ | 1 | 12.5% |
| № | 1 | 12.5% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 18 | |
| ¢ | 2 | 9.5% |
| £ | 1 | 4.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 966 | |
| – | 15 | 1.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 82 | |
| ] | 5 | 5.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 80 | |
| [ | 5 | 5.9% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 37 | |
| ” | 1 | 2.6% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 | |
| “ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 90833 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Format
| Value | Count | Frequency (%) |
| | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 650940 | |
| Common | 106443 | 14.0% |
| Cyrillic | 346 | < 0.1% |
| Greek | 170 | < 0.1% |
| Arabic | 11 | < 0.1% |
| Katakana | 8 | < 0.1% |
| Han | 5 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 76254 | 11.7% |
| a | 48947 | 7.5% |
| o | 45672 | 7.0% |
| n | 40820 | 6.3% |
| r | 40022 | 6.1% |
| i | 39767 | 6.1% |
| t | 36724 | 5.6% |
| s | 29521 | 4.5% |
| h | 28522 | 4.4% |
| l | 25926 | 4.0% |
| Other values (107) | 238765 |
Common
| Value | Count | Frequency (%) |
| 90833 | ||
| : | 3718 | 3.5% |
| ' | 2505 | 2.4% |
| . | 1603 | 1.5% |
| , | 1134 | 1.1% |
| - | 966 | 0.9% |
| 2 | 861 | 0.8% |
| 1 | 697 | 0.7% |
| ! | 647 | 0.6% |
| 0 | 616 | 0.6% |
| Other values (50) | 2863 | 2.7% |
Cyrillic
| Value | Count | Frequency (%) |
| е | 32 | 9.2% |
| о | 32 | 9.2% |
| а | 29 | 8.4% |
| н | 24 | 6.9% |
| и | 23 | 6.6% |
| р | 22 | 6.4% |
| к | 17 | 4.9% |
| с | 15 | 4.3% |
| л | 14 | 4.0% |
| в | 14 | 4.0% |
| Other values (38) | 124 |
Greek
| Value | Count | Frequency (%) |
| α | 20 | 11.8% |
| ι | 14 | 8.2% |
| ο | 14 | 8.2% |
| τ | 9 | 5.3% |
| ά | 8 | 4.7% |
| λ | 8 | 4.7% |
| ρ | 8 | 4.7% |
| ν | 7 | 4.1% |
| ε | 6 | 3.5% |
| ς | 6 | 3.5% |
| Other values (32) | 70 |
Katakana
| Value | Count | Frequency (%) |
| テ | 1 | |
| ポ | 1 | |
| ィ | 1 | |
| ス | 1 | |
| タ | 1 | |
| ン | 1 | |
| ァ | 1 | |
| フ | 1 |
Arabic
| Value | Count | Frequency (%) |
| چ | 2 | |
| ه | 2 | |
| ی | 2 | |
| ک | 2 | |
| س | 1 | |
| ا | 1 | |
| ج | 1 |
Han
| Value | Count | Frequency (%) |
| 傳 | 1 | |
| 空 | 1 | |
| 時 | 1 | |
| 狗 | 1 | |
| 貓 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 756358 | |
| None | 1124 | 0.1% |
| Cyrillic | 346 | < 0.1% |
| Punctuation | 62 | < 0.1% |
| Arabic | 11 | < 0.1% |
| Katakana | 8 | < 0.1% |
| CJK | 5 | < 0.1% |
| Misc Symbols | 3 | < 0.1% |
| Letterlike Symbols | 2 | < 0.1% |
| Math Operators | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 90833 | 12.0% | |
| e | 76254 | 10.1% |
| a | 48947 | 6.5% |
| o | 45672 | 6.0% |
| n | 40820 | 5.4% |
| r | 40022 | 5.3% |
| i | 39767 | 5.3% |
| t | 36724 | 4.9% |
| s | 29521 | 3.9% |
| h | 28522 | 3.8% |
| Other values (76) | 279276 |
None
| Value | Count | Frequency (%) |
| é | 218 | |
| ä | 127 | 11.3% |
| ö | 55 | 4.9% |
| è | 53 | 4.7% |
| ô | 44 | 3.9% |
| ü | 39 | 3.5% |
| ó | 37 | 3.3% |
| á | 35 | 3.1% |
| ı | 35 | 3.1% |
| í | 33 | 2.9% |
| Other values (108) | 448 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 37 | |
| – | 15 | |
| … | 5 | 8.1% |
| | 2 | 3.2% |
| ‘ | 1 | 1.6% |
| ” | 1 | 1.6% |
| “ | 1 | 1.6% |
Cyrillic
| Value | Count | Frequency (%) |
| е | 32 | 9.2% |
| о | 32 | 9.2% |
| а | 29 | 8.4% |
| н | 24 | 6.9% |
| и | 23 | 6.6% |
| р | 22 | 6.4% |
| к | 17 | 4.9% |
| с | 15 | 4.3% |
| л | 14 | 4.0% |
| в | 14 | 4.0% |
| Other values (38) | 124 |
Arabic
| Value | Count | Frequency (%) |
| چ | 2 | |
| ه | 2 | |
| ی | 2 | |
| ک | 2 | |
| س | 1 | |
| ا | 1 | |
| ج | 1 |
Misc Symbols
| Value | Count | Frequency (%) |
| ☆ | 2 | |
| ♡ | 1 |
CJK
| Value | Count | Frequency (%) |
| 傳 | 1 | |
| 空 | 1 | |
| 時 | 1 | |
| 狗 | 1 | |
| 貓 | 1 |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 1 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 1 | |
| № | 1 |
Math Operators
| Value | Count | Frequency (%) |
| ∞ | 1 | |
| − | 1 |
Katakana
| Value | Count | Frequency (%) |
| テ | 1 | |
| ポ | 1 | |
| ィ | 1 | |
| ス | 1 | |
| タ | 1 | |
| ン | 1 | |
| ァ | 1 | |
| フ | 1 |
Arrows
| Value | Count | Frequency (%) |
| → | 1 |
vote_average
Real number (ℝ)
ZEROS 
| Distinct | 92 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.62407942 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 2947 |
| Zeros (%) | 6.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6.8 |
| 95-th percentile | 7.8 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 1.915380993 |
|---|---|
| Coefficient of variation (CV) | 0.3405679135 |
| Kurtosis | 2.542209793 |
| Mean | 5.62407942 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | -1.524484076 |
| Sum | 255215.1 |
| Variance | 3.66868435 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2947 | 6.5% |
| 6 | 2463 | 5.4% |
| 5 | 1998 | 4.4% |
| 7 | 1884 | 4.2% |
| 6.5 | 1722 | 3.8% |
| 6.3 | 1603 | 3.5% |
| 5.5 | 1381 | 3.0% |
| 5.8 | 1369 | 3.0% |
| 6.4 | 1350 | 3.0% |
| 6.7 | 1342 | 3.0% |
| Other values (82) | 27320 |
| Value | Count | Frequency (%) |
| 0 | 2947 | |
| 0.5 | 13 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| 1 | 103 | 0.2% |
| 1.1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 185 | |
| 9.8 | 1 | < 0.1% |
| 9.6 | 1 | < 0.1% |
| 9.5 | 18 | < 0.1% |
| 9.4 | 3 | < 0.1% |
vote_count
Real number (ℝ)
ZEROS 
| Distinct | 1820 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 110.0899315 |
| Minimum | 0 |
|---|---|
| Maximum | 14075 |
| Zeros | 2849 |
| Zeros (%) | 6.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 10 |
| Q3 | 34 |
| 95-th percentile | 434 |
| Maximum | 14075 |
| Range | 14075 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 491.7272948 |
|---|---|
| Coefficient of variation (CV) | 4.466596429 |
| Kurtosis | 150.9384834 |
| Mean | 110.0899315 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 10.44112465 |
| Sum | 4995771 |
| Variance | 241795.7324 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3243 | 7.1% |
| 2 | 3127 | 6.9% |
| 0 | 2849 | 6.3% |
| 3 | 2785 | 6.1% |
| 4 | 2478 | 5.5% |
| 5 | 2097 | 4.6% |
| 6 | 1747 | 3.8% |
| 7 | 1570 | 3.5% |
| 8 | 1359 | 3.0% |
| 9 | 1194 | 2.6% |
| Other values (1810) | 22930 |
| Value | Count | Frequency (%) |
| 0 | 2849 | |
| 1 | 3243 | |
| 2 | 3127 | |
| 3 | 2785 | |
| 4 | 2478 |
| Value | Count | Frequency (%) |
| 14075 | 1 | |
| 12269 | 1 | |
| 12114 | 1 | |
| 12000 | 1 | |
| 11444 | 1 |
release_year
Real number (ℝ)
| Distinct | 135 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1991.882236 |
| Minimum | 1874 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 1874 |
|---|---|
| 5-th percentile | 1941 |
| Q1 | 1978 |
| median | 2001 |
| Q3 | 2010 |
| 95-th percentile | 2015 |
| Maximum | 2020 |
| Range | 146 |
| Interquartile range (IQR) | 32 |
Descriptive statistics
| Standard deviation | 24.05498602 |
|---|---|
| Coefficient of variation (CV) | 0.01207651014 |
| Kurtosis | 0.8403296365 |
| Mean | 1991.882236 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -1.224939659 |
| Sum | 90389624 |
| Variance | 578.6423525 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 1975 | 4.4% |
| 2015 | 1905 | 4.2% |
| 2013 | 1889 | 4.2% |
| 2012 | 1723 | 3.8% |
| 2011 | 1667 | 3.7% |
| 2016 | 1604 | 3.5% |
| 2009 | 1586 | 3.5% |
| 2010 | 1501 | 3.3% |
| 2008 | 1473 | 3.2% |
| 2007 | 1320 | 2.9% |
| Other values (125) | 28736 |
| Value | Count | Frequency (%) |
| 1874 | 1 | |
| 1878 | 1 | |
| 1883 | 1 | |
| 1887 | 1 | |
| 1888 | 2 |
| Value | Count | Frequency (%) |
| 2020 | 1 | < 0.1% |
| 2018 | 5 | < 0.1% |
| 2017 | 532 | 1.2% |
| 2016 | 1604 | |
| 2015 | 1905 |
return
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 5232 |
|---|---|
| Distinct (%) | 11.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 659.9991483 |
| Minimum | 0 |
|---|---|
| Maximum | 12396383 |
| Zeros | 39998 |
| Zeros (%) | 88.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2.53534128 |
| Maximum | 12396383 |
| Range | 12396383 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 74690.82512 |
|---|---|
| Coefficient of variation (CV) | 113.1680629 |
| Kurtosis | 20674.32378 |
| Mean | 659.9991483 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 138.3340992 |
| Sum | 29950101.35 |
| Variance | 5578719356 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 39998 | |
| 1 | 20 | < 0.1% |
| 2 | 12 | < 0.1% |
| 4 | 11 | < 0.1% |
| 5 | 8 | < 0.1% |
| 3 | 7 | < 0.1% |
| 2.5 | 7 | < 0.1% |
| 1.333333333 | 7 | < 0.1% |
| 1.5 | 6 | < 0.1% |
| 7 | 4 | < 0.1% |
| Other values (5222) | 5299 | 11.7% |
| Value | Count | Frequency (%) |
| 0 | 39998 | |
| 5.217391304 × 10-7 | 1 | < 0.1% |
| 7.5 × 10-7 | 1 | < 0.1% |
| 9.375 × 10-7 | 1 | < 0.1% |
| 1.499133126 × 10-6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 12396383 | 1 | |
| 8500000 | 1 | |
| 4197476.625 | 1 | |
| 2755584 | 1 | |
| 1018619.283 | 1 |